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(D) TOPOLOGY: linear 


(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 


ATGCATCCCA 

TGCATCCCGA 

AAACCACGCC 

ATCGCCCGGA 

GCACGAGCAC 

CACTAATAAC 

60 

CCATCTCGCA 

GTCGGAGCAG 

CAGGATGTGG 

CTCCTGCCCG 

CCTGGCTGCT 

CCTCGTCCTG 

120 

GTGGCCAGCA 

ATGGCCTGCC 

AGCAGTCAGA 

GGCCAGTACC 

AATCGCCACG 

TATCATCGAG 

180 

CATCCCACGG 

ATCTGGTCGT 

TAAGAAGAAT 

GAACCCGCCA 

CGCTCAACTG 

CAAAGTGGAG 

240 

GGCAAGCCGG 

AACCCACCAT 

TGAGTGGTTT 

AAGGATGGCG 

AACCCGTCAG 

CACCAACGAA 

300 

AAGAAATCGC 

ACCGCGTCCA 

GTTCAAGGAC 

GGCGCCCTCT 

TCTTTTACAG 

GACAATGCAA 

360 

GGCAAGAAGG 

AGCAGGACGG 

CGGAGAGTAC 

TGGTGCGTGG 

CCAAGAACCG 

AGTGGGCCAG 

420 

GCCGTTAGTC 

GCCATGCCTC 

CCTCCAGATA 

GCTGTTTTGC 

GCGACGATTT 

TCGCGTGGAG 

480 

CCCAAAGACA 

CGCGAGTGGC 

CAAAGGCGAG 

ACGGCTCTGC 

TGGAGTGTGG 

GCCGCCCAAA 

540 

GGCATTCCAG 

AGCCAACGCT 

GATTTGGATA 

AAGGACGGCG 

TTCCCTTGGA 

CG AC C TG AAA 

600 

GCCATGTCGT 

TTGGCGCCAG 

CTCCCGCGTT 

CGAATTGTGG 

ACGGTGGCAA 

CCTGCTGATC 

66 0 

AGCAATGTGG 

AGCCCATTGA 

TGAGGGCAAC 

TACAAGTGCA 

TTGCCCAGAA 

TCTGGTAGGC 

720 

ACCCGCGAGA 

GCAGCTATGC 

CAAGCTGATT 

GTCCAGGTCA 

AACCATACTT 

TATGAAGGAG 

780 

CCCAAGGATC 

AGGTGATGCT 

CTACGGCCAG 

ACAGCCACTT 

TCCACTGCTC 

AGTGGGCGGT 

840 

GATCCGCCGC 

CGAAAGTGTT 

GTGGAAAAAG 

GAGGAGGGCA 

ATATTCCGGT 

GTCCAGAGCG 

900 

CGAATCCTTC 

ACGACGAGAA 

AAGTTTAGAG 

ATATCCAACA 

TAACGCCCAC 

CGATGAGGGC 

960 

ACCTATGTCT 

GCGAGGCACA 

CAACAATGTC 

GGTCAGATCA 

GCGCTAGGGC 

TTCTCTTATA 

1020 

GTCCACGCTC 

CGCCGAACTT 

TACGAAAAGA 

CCCAGTAACA 

AGAAAGTGGG 

ACTAAATGGG 

1080 

GTTGTCCAAC 

T AC C TTG CAT 

GGCCTCCGGA 

AACCCTCCGC 

CGTCTGTATT 

CTGGACCAAG 

1140 

GAAGGAGTAT 

CCACTCTTAT 

GTTCCCAAAT 

AGTTCGCACG 

GAAGGCAGTA 

TGTGGCTGCC 

1200 

GATGGAACTC 

TGCAGATTAC 

GGATGTGCGG 

CAGGAAGACG 

AAGGCTACTA 

TGTGTGTTCC 

1260 

GCTTTCAGTG 

TAGTCGATTC 

CTCTACAGTA 

CGGGTTTTCC 

TGCAAGTCAG 

CTCGGTAGAC 

1320 

GAGCGTCCAC 

CTCCGATTAT 

TCAAATCGGA 

CCTGCCAATC 

AAACACTGCC 

C AAGGG AT C A 

1380 

GTTGCTACTT 

TACCCTGTCG 

GGCCACTGGA 

AATCCCAGTC 

CCCGTATCAA 

GTGGTTCCAC 

1440 

GATGGACATG 

CCGTACAAGC 

GGG CAATCGA 

TACAGCATCA 

TCCAAGGAAG 

CTCACTGAGA 

1500 

GTCGATGACC 

TTCAACTAAG 

TGACTCTGGT 

ACCTACACCT 

GCACTGCATC 

TGGCGAACGA 

1560 

GGAGAAACTT 

CCTGGGCTGC 

CACACTAACG 

GTGGAAAAAC 

CCGGTTCTAC 

ATCTCTTCAC 

1620 

CGGGCAGCTG 

AT C C TAG C AC 

TTATCCTGCT 

CCTCCAGGAA 

CACCTAAAGT 

CCTGAATGTC 

1680 

AGTCGCACrA 

GC ATTAGTCT 

TCGTTGGGCT 

AAAAGCCAAG 

AGAAACCCGG 

AGCTGTGGGC 

174 0 

CCAATCATTG 

GATACACTGT 

AGAGTACTTC 

AGTCCGGATC 

TGCAAACTGG 

TTGGATTGTG 

1800 

GCTGCCCATC 

GAGTCGGCGA 

CACTCAAGTC 

ACTATCTCGG 

GTCTCACTCC 

TGGCACTTCG 

1860 

TATGTGTTCC 

TAGTTAGAGC 

TGAGAATACT 

CAGGGTATTT 

CTGTGCCTTC 

CGGCTTATCA 

1920 

AATGTTATTA 

AAACCATTGA 

GGCAGATTTC 

GATGCAGCTT 

CTGCCAATGA 

TTTGTCAGCA 

1980 

GCTCGAACTT 

TGCTGACAGG 

AAAGTCGGTG 

GAGCTAATAG 

ATGCCTCGGC 

TATCAATGCT 

2040 

AGTGCCGTTA 

GACTTGAGTG 

GATGCTCCAC 

GTGAGCGCTG 

ATGAGAAATA 

CGTAGAGGGC 

2100 
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CTGCGCATAC 

ACTATAAGGA 

TGCCAGTGTA 

CCATCCGCAC 

AGTATCACTC 

GATCACTGTT 

2160 

ATGGATGCCT 

CTGCAGAATC 

GTTTGTGGTG 

GGAAACCTTA 

AGAAGTACAC 

CAAGTATGAG 

2220 

TTCTTCCTAA 

CACCCTTTTT 

TGAGACAATT 

GAAGGACAGC 

CCAGTAACTC 

CAAGACAGCC 

2280 

CTCACCTATG 

AAGATGTTCC 

CTCCGCACCA 

CCGGATAACA 

TTCAGATTGG 

CATGTACAAC 

2340 

CAAACAGCCG 

GTTGGGTGCG 

TTGGACTCCG 

CCACCCTCCC 

AG C AC C AC AA 

TGGCAATTTG 

2400 

TATGGCTACA 

AGATTGAGGT 

CAGCGCCGGT 

AACACCATGA 

AGGTGCTGGC 

CAATATGACT 

2460 

CTTAATGCTA 

CCACCACATC 

TGTGCTCCTA 

AATAACCTAA 

CCACCGGAGC 

TGTGTACAGC 

2520 

GTGAGGTTGA 

ACTCCTTTAC 

CAAGGCAGGA 

GATGGACCTT 

ACTCCAAACC 

GATATCACTA 

2580 

TTCATGGACC 

CCACCCATCA 

TGTGCATCCG 

CCACGGGCAC 

ATCCAAGCGG 

CACCCATGAT 

2640 

GGGCGACATG 

AGGGACAGGA 

TCTCACGTAT 

CATAACAATG 

GCAACATACC 

ACCTGGCGAC 

2700 

ATTAATCCCA 

CCACTCATAA 

AAAGACCACT 

GACTACCTAT 

CTGGAC CGTG 

GCTAATGGTG 

2760 

CTGGTCTGCA 

TCGTTCTTCT 

AGTCCTGGTT 

ATTTCGGCGG 

CTATTTCGAT 

GGTCTACTTC 

2820 

AAGCGCAAGC 

ATCAAATGAC 

CAAGGAATTG 

GGTCACTTAA 

GTGTGGTCAG 

TGACAACGAA 

2880 

ATAAC CGC AT 

TAAATATCAA 

TAG C AAAG AG 

AGCCTTTGGA 

TAG AC CATC A 

TCGTGGATGG 

2940 

CGAACTGCCG 

ATACTGACAA 

AGACTCAGGA 

TTAAGCGAAT 

CGAAGCTACT 

ATCCCACGTT 

3000 

AACAGCAGTC 

AATCCAACTA 

CAATAACTCC 

GATGGAGGAA 

CCGATTATGC 

AGAAGTTGAC 

3060 

ACCCGTAACC 

TTACCACCTT 

CTACAATTGT 

CGCAAGAGCC 

CCGATAATCC 

CACGCCGTAC 

3120 

GCCACCACTA 

TGATCATTGG 

TACCTCTTCC 

AGTGAGACCT 

GCACCAAGAC 

AACATCTATA 

3180 

AGTGCCGATA 

AGGACTCGGG 

AACTCATTCG 

CCCTATTCTG 

ACGCATTTGC 

CGGTCAGGTG 

3240 

CCAGCGGTTC 

CTGTTGTCAA 

AT C C AAC TAT 

CTTCAGTATC 

CGGTTGAACC 

GATCAACTGG 

3300 

TCAGAGTTTC 

TACCCCCGCC 

GCCAGAACAC 

CCACCTCCGT 

CTTCTACCTA 

TGGATACGCA 

3360 

CAAGGATCTC 

CTGAATCTTC 

GCGGAAGAGC 

TCCAAAAGCG 

CAGGTTCCGG 

CATTTCTACA 

3420 

AATCAAAGCA 

TTCTGAACGC 

ATCCATACAC 

AGCAGCTCCT 

CGGGCGGCTT 

TTCAGCTTGG 

3480 

GGAGTATCGC 

CCCAATATGC 

TGTCGCCTGT 

CCACCGGAAA 

ACGTTTATAG 

CAATCCGCTG 

3540 

TCGGCAGTGG 

CTGGCGGCAC 

CCAGAACCGC 

TATCAGATAA 

CGCCCACAAA 

CCAACATCCG 

3600 

CCACAGTTAC 

CGGCCTACTT 

TGCCACCACG 

GGTCCAGGAG 

GAGCTGTACC 

ACCCAACCAC 

3660 

CTGCCATTTG 

CCACACAGCG 

TCATGCAGCC 

AGCGAGTACC 

AGGCTGGACT 

GAATGCAGCG 

3720 

CGATGTGCCC 

AAAGCCGCGC 

CTGCAACAGC 

TGCGATGCCT 

TGGCCACACC 

CTCGCCCATG 

3780 

CAACCCCCAC 

CGCCAGTTCC 

CGTACCCGAG 

GGCTGGTACC 

AACCGGTGCA 

TCCCAATAGC 

3840 

CACCCGATGC 

ACCCGACCTC 

CTCCAACCAC 

CAGATCTACC 

AGTGCTCCTC 

CGAGTGCTCG 

3900 

GATCACTCGA 

GGAGCTCGCA 

GAGTCACAAG 

CGGCAGCTGC 

AGCTCGAGGA 

GCACGGCAGC 

3960 

AGTGCCAAAC 

AACGCGGAGG 

ACACCACCGT 

CGACGAGCCC 

CGGTGGTGCA 

GCCGTGCATG 

4020 

G AG AG CG AG A 

ACGAGAACAT 

GCTGGCGGAG 

TACGAGCAGC 

GCCAGTACAC 

CAGCGATTGC 

4080 

TGCAATAGCT 

CCCGCGAGGG 

CGACACCTGC 

TCCTGCAGCG 

AGGGATCCTG 

TCTTTACGCC 

4140 

GAGGCGGGCG 

AGCCGGCGCC 

TCGTCAAATG 

ACTGCTAAGA 

ACACCTAA 


4188 


(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 13 95 amino acids. 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met His Pro Met His Pro Glu Asn His Ala lie Ala Arg Ser Thr Ser 

15 10 15 

Thr Thr Asn Asn Pro Ser Arg Ser Arg Ser Ser Arg Met Trp Leu Leu 

20 25 30 

Pro Ala Trp Leu Leu Leu Val Leu Val Ala Ser Asn Gly Leu Pro Ala 

35 40 45 

Val Arg Gly Gin Tyr Gin Ser Pro Arg lie lie Glu His Pro Thr Asp 

50 55 60 

Leu Val Val Lys Lys Asn Glu Pro Ala Thr Leu Asn Cys Lys Val Glu 
65 70 75 80 

Gly Lys Pro Glu Pro Thr lie Glu Trp Phe Lys Asp Gly Glu Pro Val 

85 90 95 

Ser Thr Asn Glu Lys Lys Ser His Arg Val Gin Phe Lys Asp Gly Ala 

100 105 110 

Leu Phe Phe Tyr Arg Thr Met Gin Gly Lys Lys Glu Gin Asp Gly Gly 

115 120 125 

Glu Tyr Trp Cys Val Ala Lys Asn Arg Val Gly Gin Ala Val Ser Arg 

130 135 140 

His Ala Ser Leu Gin lie Ala Val Leu Arg Asp Asp Phe Arg Val Glu 
145 150 155 160 

Pro Lys Asp Thr Arg Val Ala Lys Gly Glu Thr Ala Leu Leu Glu Cys 

165 170 175 

Gly Pro Pro Lys Gly lie Pro Glu Pro Thr Leu lie Trp lie Lys Asp 

180 185 190 

Gly Val Pro Leu Asp Asp Leu Lys Ala Met Ser Phe Gly Ala Ser Ser 

195 200 205 

Arg Val Arg lie Val Asp Gly Gly Asn Leu Leu lie Ser Asn Val Glu 

210 215 220 

Pro lie Asp Glu Gly Asn Tyr Lys Cys lie Ala Gin Asn Leu Val Gly 
225 230 235 240 

Thr Arg Glu Ser Ser Tyr Ala Lys Leu lie Val Gin Val. Lys Pro Tyr 
245 250 255 
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Phe Met Lys Glu Pro Lys Asp Gin Val Met Leu Tyr Gly Gin Thr Ala 

260 265 270 

Thr Phe His Cys Ser Val Gly Gly Asp Pro Pro Pro Lys Val Leu Trp 

275 280 285 

Lys Lys Glu Glu Gly Asn lie Pro Val Ser Arg Ala Arg lie Leu His 

290 295 300 

Asp Glu Lys Ser Leu Glu lie Ser Asn lie Thr Pro Thr Asp Glu Gly 
305 310 315 320 

Thr Tyr Val Cys Glu Ala His Asn Asn Val Gly Gin lie Ser Ala Arg 

325 330 335 

Ala Ser Leu lie Val His Ala Pro Pro Asn Phe Thr Lys Arg Pro Ser 

340 345 350 

Asn Lys Lys Val Gly Leu Asn Gly Val Val Gin Leu Pro Cys Met Ala 

355 360 365 

Ser Gly Asn Pro Pro Pro Ser Val Phe Trp Thr Lys Glu Gly Val Ser 

370 375 380 " ' 

Thr Leu Met Phe Pro Asn Ser Ser His Gly Arg Gin Tyr Val Ala Ala 
385 390 395 400 

Asp Gly Thr Leu Gin lie Thr Asp Val Arg Gin Glu Asp Glu Gly Tyr 

405 410 415 

Tyr Val Cys Ser Ala Phe Ser Val Val Asp Ser Ser Thr Val Arg Val 

420 425 430 

Phe Leu Gin Val Ser Ser Val Asp Glu Arg Pro Pro Pro lie lie Gin 

435 440 445 

lie Gly Pro Ala Asn Gin Thr Leu Pro Lys Gly Ser Val Ala Thr Leu 

450 455 460 

Pro Cys Arg Ala Thr Gly Asn Pro Ser Pro Arg lie Lys Trp Phe His 
465 470 475 480 

Asp Gly His Ala Val Gin Ala Gly Asn Arg Tyr Ser lie lie Gin Gly 

485 490 495 

Ser Ser Leu Arg Val Asp Asp Leu Gin Leu Ser Asp Ser Gly Thr Tyr 

500 505 510 

Thr Cys Thr Ala Ser Gly Glu Arg ■ Gly Glu Thr Ser Trp Ala Ala Thr 

515 520 525 

Leu Thr Val Glu Lys Pro Gly Ser Thr Ser Leu His Arg Ala Ala Asp 

530 535 540 

Pro Ser Thr Tyr Pro Ala Pro Pro Gly Thr Pro Lys Val Leu Asn Val 
545 550 555 560 
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Ser Arg Thr Ser lie 
565 

Gly Ala Val Gly Pro 
580 

Asp Leu Gin Thr Gly 
595 

Gin Val Thr lie Ser 
610 

Val Arg Ala Glu Asn 
625 

Asn Val lie Lys Thr 
645 

Asp Leu Ser Ala Ala 
660 

lie Asp Ala Ser Ala 
675 

Leu His Val Ser Ala 
690 

Tyr Lys Asp Ala Ser 
705 

Met Asp Ala Ser Ala 
725 

Thr Lys Tyr Glu Phe 
740 

Gin Pro Ser Asn Ser 
755 

Ala Pro Pro Asp Asn 
770 

Trp Val Arg Trp Thr 
785 

Tyr Gly Tyr Lys lie 
805 

Ala Asn Met Thr Leu 
820 

Leu Thr Thr Gly Ala 
835 

Ala Gly Asp Gly Pro 
850 


Ser Leu Arg Trp Ala 
570 

lie lie Gly Tyr Thr 
585 

Trp lie Val Ala Ala 
600 

Gly Leu Thr Pro Gly 
615 

Thr Gin Gly lie Ser 
630 

lie Glu Ala Asp Phe 
650 

Arg Thr Leu Leu Thr 
665 

lie Asn Ala Ser Ala 
680 

Asp Glu Lys Tyr Val 
695 

Val Pro Ser Ala Gin 
710 

Glu Ser Phe Val Val 
730 

Phe Leu Thr Pro Phe 
745 

Lys Thr Ala Leu Thr 
760 

lie Gin lie Gly Met 
775 

Pro Pro Pro Ser Gin 
790 

Glu Val Ser Ala Gly 
810 

Asn Ala Thr Thr Thr 
825 

Val Tyr Ser Val Arg 
840 

Tyr Ser Lys Pro lie 
855 
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Lys Ser Gin Glu Lys 
575 

Val Glu Tyr Phe Ser 
590 

His Arg Val Gly Asp 
605 

Thr Ser Tyr Val Phe 
620 

Val Pro Ser Gly Leu 
635 

Asp Ala Ala Ser Ala 
655 

Gly Lys Ser Val Glu 
670 

Val Arg Leu Glu Trp 
685 

Glu Gly Leu Arg lie 
700 

Tyr His Ser He Thr 
715 

Gly Asn Leu Lys Lys 
735 

Phe Glu Thr He Glu 
750 

Tyr Glu Asp Val Pro 
765 

Tyr Asn Gin Thr Ala 
780 

His His Asn Gly Asn 
795 

Asn Thr Met Lys Val 
815 

Ser Val Leu Leu Asn 
830 

Leu Asn Ser Phe Thr 
845 

Ser Leu Phe Met Asp 
860 


Pro 

Pro 

Thr 

Leu 

Ser 
640 
Asn 

Leu 

Met 

His 

Val 
720 
Tyr 

Gly 

Ser 

Gly 

Leu 
800 
Leu 

Asn 

Lys 

Pro 

B98-006 


Thr His His Val His Pro Pro Arg Ala His Pro Ser Gly Thr His Asp 
865 870 875 880 

Gly Arg His Glu Gly Gin Asp Leu Thr Tyr His Asn Asn Gly Asn lie 
885 890 895 

Pro Pro Gly Asp lie Asn Pro Thr Thr His Lys Lys Thr Thr Asp Tyr 

900 905 910 

Leu Ser Gly Pro Trp Leu Met Val Leu Val Cys lie Val Leu Leu Val 

915 920 925 

Leu Val lie Ser Ala Ala lie Ser Met Val Tyr Phe Lys Arg Lys His 

930 935 940 

Gin Met Thr Lys Glu Leu Gly His Leu Ser Val Val Ser Asp Asn Glu 
945 950 955 960 

lie Thr Ala Leu Asn lie Asn Ser Lys Glu Ser Leu Trp lie Asp His 

965 970 975 

His Arg Gly Trp Arg Thr Ala Asp Thr Asp Lys Asp Ser Gly Leu Ser 

980 985 990 

Glu Ser Lys Leu Leu Ser His Val Asn Ser Ser Gin Ser Asn Tyr Asn 

995 1000 1005 

Asn Ser Asp Gly Gly Thr Asp Tyr Ala Glu Val Asp Thr Arg Asn Leu 

1010 1015 1020 

Thr Thr Phe Tyr Asn Cys Arg Lys Ser Pro Asp Asn Pro Thr Pro Tyr 
1025 1030 1035 1040 

Ala Thr Thr Met lie lie Gly Thr Ser Ser Ser Glu Thr Cys Thr Lys 

1045 1050 1055 

Thr Thr Ser lie Ser Ala Asp Lys Asp Ser Gly Thr His Ser Pro Tyr 

1060 1065 1070 

Ser Asp Ala Phe Ala Gly Gin Val Pro Ala Val Pro Val Val Lys Ser 

1075 1080 1085 

Asn Tyr Leu Gin Tyr Pro Val Glu Pro lie Asn Trp Ser Glu Phe Leu 

1090 1095 1100 

Pro Pro Pro Pro Glu His Pro Pro Pro Ser Ser Thr Tyr Gly Tyr Ala 
1105 1110 1115 1120 

Gin Gly Ser Pro Glu Ser Ser Arg Lys Ser Ser Lys Ser Ala Gly Ser 

1125 1130 1135 

Gly lie Ser Thr Asn Gin Ser lie Leu Asn Ala Ser lie His Ser Ser 

1140 1145 1150 

Ser Ser Gly Gly Phe Ser Ala Trp Gly Val Ser Pro Gin Tyr Ala Val 

1155 1160 1165 


B9S-006 


Ala Cys Pro Pro Glu Asn Val Tyr Ser Asn Pro Leu Ser Ala Val Ala 

1170 1175 1180 

Gly Gly Thr Gin Asn Arg Tyr Gin lie Thr Pro Thr Asn Gin His Pro 
1185 1190 1195 1200 

Pro Gin Leu Pro Ala Tyr Phe Ala Thr Thr Gly Pro Gly Gly Ala Val 

1205 1210 1215 

Pro Pro Asn His Leu Pro Phe Ala Thr Gin Arg His Ala Ala Ser Glu 

1220 1225 1230 

Tyr Gin Ala Gly Leu Asn Ala Ala Arg Cys Ala Gin Ser Arg Ala Cys 

1235 1240 1245 

Asn Ser Cys Asp Ala Leu Ala Thr Pro Ser Pro Met Gin Pro Pro Pro 

1250 1255 1260 

Pro Val Pro Val Pro Glu Gly Trp Tyr Gin Pro Val His Pro Asn Ser 
1265 1270 1275 1280 

His Pro Met His Pro Thr Ser Ser Asn His Gin lie Tyr Gin Cys Ser 

1285 1290 1295 

Ser Glu Cys Ser Asp His Ser Arg Ser Ser Gin Ser His Lys Arg Gin 

1300 1305 1310 

Leu Gin Leu Glu Glu His Gly Ser Ser Ala Lys Gin Arg Gly Gly His 

1315 1320 1325 

His Arg Arg Arg Ala Pro Val Val Gin Pro Cys Met Glu Ser Glu Asn 

1330 1335 1340 

Glu Asn Met Leu Ala Glu Tyr Glu Gin Arg Gin Tyr Thr Ser Asp Cys 
1345 1350 1355 1360 

Cys Asn Ser Ser Arg Glu Gly Asp Thr Cys Ser Cys Ser Glu Gly Ser 

1365 1370 1375 

Cys Leu Tyr Ala Glu Ala Gly Glu Pro Ala Pro Arg Gin Met Thr Ala 
1380 1385 1390 

Lys Asn Thr 
1395 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE : CDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 


GGTGAAAATC 

CACGCATCAT 

CGAGCATCCC 

ATGGACACGA 

CGGTGCCAAA 

AAATG AT C C A 

60 

TTTACGTTTA 

ATTGCCAGGC 

CGAGGGCAAT 

CCAACACCAA 

CCATTCAATG 

GTTTAAGGAC 

120 

GGTCGCGAAC 

TGAAGACGGA 

TACGGGTTCG 

CATCGCATAA 

TGCTGCCCGC 

CGGGGGTCTA 

180 

TTCTTTCTCA 

AGGTTATCCA 

CTCACGTAGA 

GAG AG CG ATG 

CGGGCACTTA 

CTGGTGCGAG 

240 

GCCAAAAACG 

AGTTTGGAGT 

GGCACGGTCC 

AGGAATGCAA 

CGTTGCAAGT 

GGCAGTTCTC 

300 

CGCGACGAAT 

TCCGTTTGGA 

GCCGGCAAAT 

ACCCGCGTGG 

CCCAAGGCGA 

GGTGGCCCTG 

360 

ATGGAATGCG 

GTGCCCCCCG 

AGGATCTCCG 

GAGCCGCAAA 

TCTCGTGGCG 

CAAGAACGGC 

420 

CAGACCCTGA 

ATCTTGTCGG 

GAACAAGCGG 

ATTCGCATTG 

TCGACGGTGG 

CAATCTGGCC 

480 

ATCCAGGAAG 

CCCGCCAATC 

GGACGACGGA 

CGCTACCAGT 

GTGTGGTCAA 

GAATGTGGTT 

540 

GGCACCCGGG 

AGTCGGCCAC 

CGCTTTTCTT 

AAAGTGCATG 

TACGTCCATT 

CCTCATCCGA 

600 

GGACCCCAGA 

ATCAGACGGC 

GGTGGTGGGC 

AGCTCGGTGG 

TCTTCCAGTG 

CCGCATCGGA 

660 

GGCGATCCCC 

TGCCTGATGT 

CCTGTGGCGA 

CGCACTGCCT 

CCGGCGGCAA 

TATGCCACTG 

720 

CGTAAGTTTT 

CTTGGCTTCA 

TTCAG CTTC A 

GGTCGTGTGC 

ACGTACTTGA 

GGACCGCAGT 

780 

CTGAAGCTGG 

ACGACGTTAC 

TCTGGAGGAC 

ATGGGCGAGT 

ACACTTGCGA 

GGCGGACAAT 

840 

GCGGTGGGCG 

GCATCACGGC 

CACTGGCATC 

CTCACCGTTC 

ACGCTCCCCC 

CAAATTTGTG 

900 

ATACGCCCCA 

AGAATCAGCT 

GGTGGAGATC 

GGTGATGAAG 

TGCTGTTCGA 

GTGCCAAGCG 

960 

AATGGACATC 

CCCGACCAAC 

GCTCTACTGG 

TCGGTGGAGG 

GCAACAGCTC 

CCTGCTGCTC 

1020 

CCCGGCTATC 

GGG ATGGCCG 

CATGGAAGTG 

ACCCTGACGC 

CCGAGGGGCG 

CTCGGTGCTC 

1080 

TCGATAGCTC 

GATTTGCCCG 

TGAGGATTCC 

GGAAAGGTGG 

TCACTTGCAA 

CGCCCTGAAC 

1140 

GCCGTGGGCA 

GCGTCAGCAG 

TCGGACTGTG 

GTCAGTGTGG 

ATACGCAATT 

CGAGCTGCCA 

1200 

CCGCCGATTA 

TCGAACAGGG 

GCCCGTGAAT 

CAAACGTTGC 

CCGTTAAATC 

AATTGTGGTT 

1260 

CTGCCATGCC 

GAACTCTGGG 

CACTCCAGTG 

CCACAGGTCT 

CTTGGTACCT 

GG ATGG CAT A 

1320 

CCCATCGATG 

TGCAGGAGCA 

CGAGCGGCGG 

AATCTTTCGG 

ACGCTGGAGC 

CTTAACCATT 

1380 

TCGGATCTTC 

AGCGCCACGA 

GGATGAAGGC 

TTGTACACCT 

GCGTGGCCAG 

CAATCGCAAC 

1440 

GGAAAATCCT 

CTTGGAGTGG 

TTACCTTCGT 

CTGGACACCC 

CGACAAATCC 

GAATATCAAG 

1500 

TTCTTCAGAG 

CCCCAGAACT 

TTCCACCTAC 

CCAGGGCCGC 

CAGGAAAACC 

GCAAATGGTG 

1560 

GAGAAGGGCG 

AAAATTCGGT 

GACTCTCAGC 

TGGACGAGGA 

GCAACAAGGT 

GGGCGGCTCC 

1620 

AGTCTGGTGG 

GCTATGTAAT 

CG AG AT GTTT 

GGCAAAAACG 

AAACGGATGG 

CTGGGTGGCT 

1680 

GTGGGCACTA 

GGGTGCAAAA 

TACCACGTTT 

ACCCAAACGG 

GTCTGCTGCC 

GGGTGTGAAT 

1740 

TACTTCTTTC 

TAATTCGAGC 

CGAGAACTCC 

CATGGCTTAT 

CACTGCCCAG 

TCCGATGTCG 

1800 

G AAC C C ATT A 

CGGTGGGAAC 

GCGCTACTTC 

AATAGTGGTC 

TGGATCTGAG 

CGAGGCTCGT 

1860 

GCCAGTCTGC 

TGTCCGGAGA 

TGTTGTGGAG 

CTGAGCAACG 

CCAGTGTGGT 

GGACTCCACT 

1920 

AGCATGAAAC 

TCACCTGGCA 

GATCATCAAT 

GGCAAATACG 

TCGAGGGCTT 

CTATGTCTAT 

1980 

GCGAGACAGT 

TGCCAAATCC 

AATAGTCAAC 

AATCCGGCGC 

CCGTTACTAG 

CAATACCAAT 

2040 

CCGCTGCTGG 

GCTCTACATC 

CACATCCGCA 

TCCGCATCCG 

CCTCGGCATC 

GGCATTGATT 

2100 

TCGACAAAGC 

CAAATATTGC 

AGCTGCCGGC 

AAACGTGATG 

GGGAGACAAA 

CCAGAGTGGA 

2160 

GGAGGAGCTC 

CGACCCCACT 

GAACACCAAG 

TATCGCATGC 

TAACGATTCT 

CAATGGCGGT 

2220 
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GGCGCCTCAT 

CCTGCACCAT 

CACCGGGCTC 

GTCCAGTACA 

CGCTGTATGA 

ATTTTTCATC 

2280 

GTGCCATTTT 

ACAAATCCGT 

CGAGGGCAAG 

CCGTCGAATT 

CGCGCATCGC 

TCGCACCCTT 

2340 

GAAGATGTTC 

CCTCTGAGGC 

ACCATATGGA 

ATGGAGGCTC 

TGCTGTTGAA 

CTCCTCCGCG 

2400 

GTCTTCCTCA 

AATGGAAGGC 

ACCAGAACTC 

AAGGATCGGC 

ATGGTGTTCT 

CTTGAACTAT 

2460 

CATGTTATAG 

TCCGAGGTAT 

TGACACTGCC 

CACAATTTCT 

CACGCATTTT 

GACAAATGTC 

2520 

AC CATCG ATG 

CCGCTTCGCC 

TACTCTGGTT 

TTGGCCAATC 

TCACCGAAGG 

CGTCATGTAC 

2580 

ACCGTGGGCG 

TGGCGGCCGG 

AAATAACGCT 

GGAGTTGGTC 

CTTATTGTGT 

CCCAGCTACT 

2640 

TTGCGTTTGG 

ATCCCATCAC 

AAAGCGACTC 

GATCCGTTCA 

TCAATCAGCG 

GGACCATGTT 

2700 

AACGATGTGC 

TGACGCAGCC 

CTGGTTCATA 

ATACTCCTGG 

GCGCCATCCT 

GGCCGTTCTT 

2760 

ATGCTGTCCT 

TTGGCGCAAT 

GGTCTTTGTG 

AAGCGCAAGC 

ACATGATGAT 

GAAGCAGTCG 

2820 

GCCCTAAATA 

CAATGCGTGG 

CAATCACACG 

AGCGACGTGC 

TCAAAATGCC 

GAGTCTATCG 

2880 

GCGCGCAATG 

GAAACGGCTA 

CTGGCTGGAC 

TCCTCCACCG 

GCGGAATGGT 

GTGGCGTCCC 

2940 

TCGCCCGGCG 

GCGACTCGCT 

GGAGATGCAA 

AAGGATCACA 

TCGCCGACTA 

TGCGCCGGTC 

3000 

TGCGGTGCCC 

CCGGTTCTCC 

GGCCGGCGGT 

GGCACCTCTT 

CCGGTGGATC 

CGGTGGCGCG 

3060 

GGCAGCGGTG 

CCAGCGGCGG 

CGATGACATT 

CATGGAGGAC 

ACGGCAGCGA 

ACGCAATCAG 

3120 

CAGCGGTACG 

TGGGCGAGTA 

CTCCAACATA 

CCGACCGACT 

ATGCAGAGGT 

GTCCAGTTTT 

3180 

GGCAAGGCAC 

CCAGCGAGTA 

TGGTCGGCAT 

GGCAACGCCT 

CCCCGGCCCC 

TTATGCCACC ' 

3240 

TCTTCGATCC 

TGAGTCCCCA 

CC AG'CAGCAA\ 

CAGCAGCAGC 

_AGCCGCGTTA 

TCAACAGCGA 

3300 

CCAGTGCCCG 

GCTATGGGCT 

CCAGCGCCCA 

ATGCACCCAC 

ACTACCAGCA 

GCAGCAGCAT 

3360 

CAGCAGCAAC 

AGGCGCAGCA 

GACGCACCAG 

CAACACCAGG 

CTCTCCAGCA 

GCACCAGCAA 

3420 

CTGCCACCCA 

GCAACATCTA 

CCAGCAGATG 

TCCACCACCA 

GCGAGATATA 

CCCCACGAAC 

3480 

ACGGGTCCTT 

CGCGCTCTGT 

CTACTCTGAG 

CAGTATTACT 

ACCCCAAGGA 

CAAGCAGAGA 

3540 

CACATCCACA 

TCACCGAGAA 

CAAGCTGAGC 

AACTGCCACA 

CCTATGAGGC 

GGCTCCTGGC 

3600 

GCCAAGCAGT 

CCTCGCCGAT 

ATCCTCGCAG 

TTCGCCAGCG 

TGAGGCGGCA 

GCAGCTGCCG 

3660 

CCCAACTGCA 

GCATCGGCAG 

GGAAAGTGCC 

CGCTTCAAGG 

TGCTAAACAC 

GGATCAGGGC 

3720 

AAGAACCAGC 

AGAATCTCCT 

GGATCTCGAC 

GGCTCCTCGA 

TGTGCTACAA 

CGGTCTGGCA 

3780 

GACTCGGGCT 

GCGGTGGATC 

TCCCTCCCCG 

ATGGCCATGC 

TGATGTCGCA 

CGAGGACGAG 

3840 

CACGCGCTGT 

ACCACACGGC 

GGATGGGGAT 

CTGGACGACA 

TGGAACGACT 

GTACGTCAAG 

3900 

GTGGACGAGC 

AGCAGCCTCC 

ACAGCAGCAG 

CAGCAGCTGA 

TTCCCCTGGT 

CCCACAGCAT 

3960 

CCGGCGGAAG 

GTCACCTGCA 

GTCCTGGCGG 

AATC AGAG C A 

CGCGGAGCAG 

TCGGAAGAAC 

4020 

GGCCAGGAAT 

GCATCAAGGA 

AC C C AG CG AG 

TTGATCTACG 

CTCCGGGAAG 

CGTGGCCAGC 

4080 

GAACGGAGCC 

TCCTCAGCAA 

CTCGGGTAGC 

GGCACCAGCA 

GCCAGCCAGC 

TGGCCACAAT 

4140 

GTCTGA 






4146 


(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1381 amino acids 

(B) TYPE: amino acid 


46 


B98-006 


(C) STRANDEDNESS : single 

(D) TOPOLOGY : linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 

Gly Glu Asn Pro Arg lie lie Glu His Pro Met Asp Thr Thr Val Pro 

15 10 15 

Lys Asn Asp Pro Phe Thr Phe Asn Cys Gin Ala Glu Gly Asn Pro Thr 

20 25 30 

Pro Thr lie Gin Trp Phe Lys Asp Gly Arg Glu Leu Lys Thr Asp Thr 

3 5" 4 0 4 5 

Gly Ser His Arg lie Met Leu Pro Ala Gly Gly Leu Phe Phe Leu Lys 

50 55 60 

Val lie His Ser Arg Arg Glu Ser Asp Ala Gly Thr Tyr Trp Cys Glu 
65 70 75 80 

Ala Lys Asn Glu Phe Gly Val Ala Arg Ser Arg Asn Ala Thr Leu Gin 

85 90 95 

Val Ala Val Leu Arg Asp Glu Phe Arg Leu Glu Pro Ala Asn Thr Arg 

100 105 110 

Val Ala Gin Gly Glu Val Ala Leu Met Glu Cys Gly Ala Pro Arg Gly 

115 120 125 

Ser Pro Glu Pro Gin lie Ser Trp Arg Lys Asn Gly Gin Thr Leu Asn 

130 135 140 

Leu Val Gly Asn Lys Arg lie Arg lie Val Asp Gly Gly Asn Leu Ala 
145 150 155 160 

lie Gin Glu Ala Arg Gin Ser Asp Asp Gly Arg Tyr Gin Cys Val Val 

165 170 175 

Lys Asn Val Val Gly Thr Arg Glu Ser Ala Thr Ala Phe Leu Lys Val 

180 185 190 

His Val Arg Pro Phe Leu lie Arg Gly Pro Gin Asn Gin Thr Ala Val 

195 200 205 

Val Gly Ser Ser Val Val Phe Gin Cys Arg lie Gly Gly Asp Pro Leu 

210 215 220 

Pro Asp Val Leu Trp Arg Arg Thr Ala Ser Gly Gly Asn Met Pro Leu 
225 230 235 240 

Arg Lys Phe Ser Trp Leu His Ser Ala Ser Gly Arg Val His Val Leu 

245 250 255 

Glu Asp Arg Ser Leu Lys Leu Asp Asp Val Thr Leu Glu Asp Met Gly 
260 265 270 
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Glu Tyr Thr Cys Glu 
275 

Gly lie Leu Thr Val 
290 

Asn Gin Leu Val Glu 
305 

Asn Gly His Pro Arg 
325 

Ser Leu Leu Leu Pro 
340 

Thr Pro Glu Gly Arg 
355 

Asp Ser Gly Lys Val 
370 

Val Ser Ser Arg Thr 
385 

Pro Pro lie lie Glu 
405 

Ser lie Val Val Leu 
420 

Val Ser Trp Tyr Leu 
435 

Arg Arg Asn Leu Ser 
450 

Arg His Glu Asp Glu 
465 

Gly Lys Ser Ser Trp 
485 

Pro Asn lie Lys Phe 
500 

Pro Pro Gly Lys Pro 
515 

Leu Ser Trp Thr Arg 
530 

Tyr Val lie Glu' Met 
545 

Val Gly Thr Arg Val 
565 


Ala Asp Asn Ala Val 
280 

His Ala Pro Pro Lys 
295 

lie Gly Asp Glu Val 
310 

Pro Thr Leu Tyr Trp 
330 

Gly Tyr Arg Asp Gly 
345 

Ser Val Leu Ser lie 
360 

Val Thr Cys Asn Ala 
375 

Val Val Ser Val Asp 
3 90 

Gin Gly Pro Val Asn 
410 

Pro Cys Arg Thr Leu 
425 

Asp Gly lie Pro lie 
440 

Asp Ala Gly Ala Leu 
455 

Gly Leu Tyr Thr Cys 
470 

Ser Gly Tyr Leu Arg 
490 

Phe Arg Ala Pro Glu 
505 

Gin Met Val Glu Lys 
520 

Ser Asn Lys Val Gly 
535 

Phe Gly Lys Asn Glu 
550 

Gin Asn Thr Thr Phe 

570 
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Gly Gly lie Thr Ala 
285 

Phe Val lie Arg Pro 
300 

Leu Phe Glu Cys Gin 
315 

Ser Val Glu Gly Asn 
335 

Arg Met Glu Val Thr 
350 

Ala Arg Phe Ala Arg 
365 

Leu Asn Ala Val Gly 
380 

Thr Gin Phe Glu Leu 

395 

Gin Thr Leu Pro Val 
415 

Gly Thr Pro Val Pro 
430 

Asp Val Gin Glu His 
445 

Thr lie Ser Asp Leu 
460 

Val Ala Ser Asn Arg 
475 

Leu Asp Thr Pro Thr 
495 

Leu Ser Thr Tyr Pro 
510 

Gly Glu Asn Ser Val 
525 

Gly Ser Ser Leu Val 
540 

Thr Asp Gly Trp Val 
555 

Thr Gin Thr Gly Leu 
575 


Thr 

Lys 

Ala 
320 
Ser 

Leu 

Glu 

Ser 

Pro 
400 
Lys 

Gin 

Glu 

Gin 

Asn 
480 
Asn 

Gly 

Thr 

Gly 

Ala 
560 
Leu 
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Pro Gly Val Asn Tyr 
580 

Leu Ser Leu Pro Ser 
595 

Tyr Phe Asn Ser Gly 
610 

Ser Gly Asp Val Val 
625 

Ser Met Lys Leu Thr 
645 

Phe Tyr Val Tyr Ala 
660 

Ala Pro Val Thr Ser 
675 

Ser Ala Ser Ala Ser 
690 

Asn lie Ala Ala Ala 
705 

Gly Gly Ala Pro Thr 
725 

Leu Asn Gly Gly Gly 
740 

Tyr Thr Leu Tyr Glu 
755 

Gly Lys Pro Ser Asn 
770 

Ser Glu Ala Pro Tyr 
785 

Val Phe Leu Lys Trp 
805 

Leu Leu Asn Tyr His 
820 

Phe Ser Arg lie Leu 
835 

Leu Val Leu Ala Asn 
850 

Ala Ala Gly Asn Asn 
865 


Phe Phe Leu lie Arg 
585 

Pro Met Ser Glu Pro 
600 

Leu Asp Leu Ser Glu 
615 

Glu Leu Ser Asn Ala 
630 

Trp Gin He He Asn 
650 

Arg Gin Leu Pro Asn 
665 

Asn Thr Asn Pro Leu 
680 

Ala Ser Ala Ser Ala 
695 

Gly Lys Arg Asp Gly 
710 

Pro Leu Asn Thr Lys 
730 

Ala Ser Ser Cys Thr 
745 

Phe Phe He Val Pro 
760 

Ser Arg He Ala Arg 
775 

Gly Met Glu Ala Leu 
790 

Lys Ala Pro Glu Leu 
810 

Val lie Val Arg Gly 
825 

Thr Asn Val Thr lie 
840 

Leu Thr Glu Gly Val 
855 

Ala Gly Val Gly Pro 
870 
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Ala Glu Asn Ser His 
590 

He Thr Val Gly Thr 
605 

Ala Arg Ala Ser Leu 
620 

Ser Val Val Asp Ser 
635 

Gly Lys Tyr Val Glu 
655 

Pro He Val Asn Asn 
670 

Leu Gly Ser Thr Ser 
685 

Leu He Ser Thr Lys 
700 

Glu Thr Asn Gin Ser 
715 

Tyr Arg Met Leu Thr 
735 

He Thr Gly Leu Val 
750 

Phe Tyr Lys Ser Val 
765 

Thr Leu Glu Asp Val 
780 

Leu Leu Asn Ser Ser 
795 

Lys Asp Arg His Gly 
815 

He Asp Thr Ala His 
830 

Asp Ala Ala Ser Pro 
845 

Met Tyr Thr Val Gly 
860 

Tyr Cys Val Pro Ala 
875 


Gly 

Arg 

Leu 

Thr 
640 
Gly 

Pro 

Thr 

Pro 

Gly 
720 
He 

Gin 

Glu 

Pro 

Ala 
800 
Val 

Asn 

Thr 

Val 

Thr 
880 
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Leu Arg Leu Asp Pro lie Thr Lys Arg Leu Asp Pro Phe lie Asn Gin 

885 890 895 

Arg Asp His Val Asn Asp Val Leu Thr Gin Pro Trp Phe lie lie Leu 

900 905 910 

Leu Gly Ala lie Leu Ala Val Leu Met Leu Ser Phe Gly Ala Met Val 

915 920 925 

Phe Val Lys Arg Lys His Met Met Met Lys Gin Ser Ala Leu Asn Thr 

930 935 940 

Met Arg Gly Asn His Thr Ser Asp Val Leu Lys Met Pro Ser Leu Ser 
945 950 955 960 

Ala Arg Asn Gly Asn Gly Tyr Trp Leu Asp Ser Ser Thr Gly Gly Met 

965 970 975 

Val Trp Arg Pro Ser Pro Gly Gly Asp Ser . Leu Glu Met Gin Lys Asp 

980 985 990 

His lie Ala Asp Tyr Ala Pro Val Cys Gly Ala Pro Gly Ser Pro Ala 

995 1000 1005 

Gly Gly Gly Thr Ser Ser Gly Gly Ser Gly Gly Ala Gly Ser Gly Ala 

1010 1015 1020 

Ser Gly Gly Asp Asp lie His Gly Gly His Gly Ser Glu Arg Asn Gin 
1025 1030 1035 1040 

Gin Arg Tyr Val Gly Glu Tyr Ser Asn lie Pro Thr Asp Tyr Ala Glu 

1045 1050 1055 

Val Ser Ser Phe Gly Lys Ala Pro Ser Glu Tyr Gly Arg His Gly Asn 

1060 1065 1070 

Ala Ser Pro Ala Pro Tyr Ala Thr Ser Ser lie Leu Ser Pro His Gin 

1075 1080 1085 

Gin Gin Gin Gin Gin Gin Pro Arg Tyr Gin Gin Arg Pro Val Pro Gly 

1090 1095 1100 

Tyr Gly Leu Gin Arg Pro Met His Pro His Tyr Gin Gin Gin Gin His 
1105 1110 1115 1120 

Gin Gin Gin Gin Ala Gin Gin Thr His Gin Gin His Gin Ala Leu Gin 

1125 1130 1135 

Gin His Gin Gin Leu Pro Pro Ser Asn lie Tyr Gin Gin Met Ser Thr 

1140 1145 1150 

Thr Ser Glu lie Tyr Pro Thr Asn Thr Gly Pro Ser Arg Ser Val Tyr 

1155 1160 1165 

Ser Glu Gin Tyr Tyr Tyr Pro Lys Asp Lys Gin Arg His lie His lie 
1170 1175 1180 
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Thr Glu Asn Lys Leu Ser Asn Cys His Thr Tyr Glu Ala Ala Pro Gly 
1185 1190 1195 1200 

Ala Lys Gin Ser Ser Pro lie Ser Ser Gin Phe Ala Ser Val Arg Arg 

1205 1210 1215 

Gin Gin Leu Pro Pro Asn Cys Ser lie Gly Arg Glu Ser Ala Arg Phe 

1220 1225 1230 

Lys Val Leu Asn Thr Asp Gin Gly Lys Asn Gin Gin Asn Leu Leu Asp 

1235 1240 1245 

Leu Asp Gly Ser Ser Met Cys Tyr Asn Gly Leu Ala Asp Ser Gly Cys 

1250 1255 1260 

Gly Gly Ser Pro Ser Pro Met Ala Met Leu Met Ser His Glu Asp Glu 
1265 1270 1275 1280 

His Ala Leu Tyr His Thr Ala Asp Gly Asp Leu Asp Asp Met Glu Arg 

1285 1290 1295 

Leu Tyr Val Lys Val Asp Glu Gin Gin Pro Pro Gin Gin Gin Gin Gin 

1300 1305 1310 

Leu lie Pro Leu Val Pro Gin His Pro Ala Glu Gly His Leu Gin Ser 

1315 1320 1325 

Trp Arg Asn Gin Ser Thr Arg Ser Ser Arg Lys Asn Gly Gin Glu Cys 

1330 1335 1340 

lie Lys Glu Pro Ser Glu Leu lie Tyr Ala Pro Gly Ser Val Ala Ser 
1345 1350 1355 1360 

Glu Arg Ser Leu Leu Ser Asn Ser Gly Ser Gly Thr Ser Ser Gin Pro 

1365 1370 1375 

Ala Gly His Asn Val 
1380 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3894 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO : 5 : 
ATGTACTATC TAGGTTTTTA CCACACTCAC ACACACACAC ACACATACAT AAATTTTGAT 6 0 

AAAATTCCTA ATGCCTCAAA TCTCGCTCCC GTGATAATCG AACATCCCAT CGATGTGGTG 12 0 

GTATCTAGGG GATCGCCAGC AACCCTCAAC TGTGGTGCAA AGCCATCTAC CGCCAAAATC 180 
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ACATGGTACA 

AGGATGGACA 

GCCCGTAATC 

ACGAATAAGG 

AGCAAGTGAA 

CAGCCACCGG 

240 

ATTGTTCTCG 

ACACGGGATC 

CCTGTTTCTT 

CTGAAAGTGA 

ATAGTGGAAA 

AAACGGAAAA 

300 

GACAGCGATG 

CGGGAGCGTA 

CTATTGTGTG 

GCCAGCAACG 

AGCACGGAGA 

AGTGAAGTCG 

360 

AACGAAGGAT 

CGTTAAAATT 

GGCGATGCTT 

CGCGAAGACT 

TTCGAGTTCG 

GCCAAGAACA 

420 

GTTCAGGCTC 

TTGGTGGAGA 

GATGGCCGTT 

CTGGAATGCA 

GTCCGCCACG 

TGGATTCCCG 

480 

GAGCCGGTTG 

TGAGCTGGCG 

GAAAGACGAC 

AAAGAGCTCC 

GAATTCAAGA 

CATGCCACGA 

540 

TACACTCTAC 

ACTCTGACGG 

AAAC CT CATC 

ATTGATCCGG 

TCGATCGAAG 

CGATTCTGGT 

600 

ACTTATCAGT 

GTGTTGCCAA 

CAACATGGTC 

GGAGAACGGG 

TGTCCAATCC 

CGCAAGATTG 

660 

AGTGTCTTTG 

AG AAAC C AAA 

GTTTGAGCAA 

GAACCCAAGG 

ACATGACGGT 

CGACGTCGGA 

720 

GCCGCAGTGC 

TGTTTGATTG 

TCGTGTGACT 

GGAGATCCTC 

AACCACAAAT 

TACGTGGAAA 

780 

CGCAAAAATG 

AGCCGATGCC 

AGTTACACGT 

GCATACATTG 

CCAAGGATAA 

TCGGGGGTTG 

840 

AGAATCGAAA 

GAGTTCAACC 

ATCAGACGAA 

GGTGAATACG 

TTTGCTATGC 

ACGAAATCCA 

900 

GCGGGAACTC 

TTG AAG CAT C 

TGCACATCTT 

CGTGTCCAGG 

CACCTCCATC 

CTTCCAGACA 

960 

AAAC C AG C AG 

ACCAGTCAGT 

TCCAGCTGGA 

GGCACGGCAA 

CTTTTGAATG 

CACCTTGGTC 

1020 

GGTCAACCGA 

GTCCCGCCTA 

TTTTTGGAGC 

AAGGAAGGCC 

AACAGGATCT 

TCTTTTCCCA 

1080 

AGTTATGTGT 

CCGCTGATGG 

TAGAACGAAA 

GTTTCACCAA 

CTGGAACATT 

GACAATTGAG 

1140 

GAAGTTCGTC 

AAGTTGATGA 

GGGAGCTTAT 

GTGTGCGCTG 

GAATGAACTC 

GGCAGGAAGC 

1200 

TCGTTGAGCA 

AGGCAGCTTT 

GAAAGCAACA 

TTTGAAACCA 

AAGGCCGTGT 

CCAAAAAAAA 

1260 

AAGAGCAAAA 

TGGGCAAACA 

GAAACAAAAA 

AATGTTCAAT 

CAATTATCAA 

ATATTTAATT 

1320 

TCAGCCGTGA 

C CGG AAAC AC 

ACCCGCCAAA 

CCACCACCAA 

C AATCG AG C A 

TGGTCATCAA 

1380 

AATCAGACCC 

TTATGGTTGG 

ATCATCAGCC 

ATCCTTCCAT 

GTCAGGCTAG 

CGGAAAACCA 

1440 

ACTCCAGGAA 

TATCATGGCT 

CAGGGATGGG 

CTACCTATTG 

ACATTACAGA 

TAGTCGTATC 

1500 

AGTCAACATT 

CAACGGGAAG 

TCTACATATT 

GCCGATTTAA 

AGAAACCTGA 

CACCGGAGTT 

1560 

TACACTTGCA 

TTGCGAAGAA 

CGAGGATGGA 

GAGTCAACAT 

GGTCGGCATC 

TCTGACTGTT 

1620 

GAAGATCACA 

CTAGCAATGC 

ACAATTTGTT 

CGGATGCCGG 

ATCCATCGAA 

CTTCCCGTCT 

1680 

TCTCCAACGC 

AACCCATTAT 

TGTCAATGTC 

ACTGATACCG 

AAGTAGAGCT 

CCACTGGAAT 

1740 

GCTCCCTCCA 

CATCTGGCGC 

AGGACCAATC 

ACTGGTTATA 

TCATTCAGTA 

CTACAGTCCA 

1800 

GACCTCGGAC 

AGACGTGGTT 

TAACATTCCA 

GACTACGTGG 

CATCTACTGA 

ATATAGAATA 

1860 

AAGGGTCTGA 

AACCATCTCA 

CTCGTATATG 

TTTGTGATTC 

GAGCAGAAAA 

TGAGAAAGGT 

1920 

ATTGGAACGC 

CGAGTGTGTC 

GTCGGCTCTC 

GTTACCACTA 

G C AAG C C AG C 

AGCTCAAGTT 

1980 

GCGCTTTCTG 

ACAAGAACAA 

AATGGACATG 

GCCATCGCTG 

AGAAGAGACT 

CACTTCGGAA 

2040 

CAACTCATAA 

AACTCGAGGA 

AG TG AAG AC T 

ATTAATTCTA 

CGGCCGTTCG 

TTTGTTCTGG 

2100 

AAGAAGAGGA 

AACTTGAAGA 

GCTGATTGAT 

GGTTACTACA 

TCAAGTGGAG 

AGGGCCTCCA 

2160 

AGAACCAATG 

ATAATCAATA 

CGTGAATGTG 

ACCAGCCCTA 

GCACCGAAAA 

CTATGTTGTT 

2220 

TCAAATTTAA 

TGCCATTCAC 

CAACTATGAG 

TTTTTCGTGA 

TTCCTTATCA 

TTCCGGAGTT 

2280 

CATAGTATTC 

ATGGAGCACC 

GAGTAATTCC 

ATGGACGTGT 

TGACCGCCGA 

AGCTCCACCT 

2340 

TCATTGCCAC 

CAGAGGATGT 

GCGAATCCGT 

ATGCTCAACC 

TGACCACTCT 

TCGTATCTCT 

2400 

TGGAAAGCAC 

CAAAAGCCGA 

CGG CATC AAC 

GGAATTCTCA 

AAGGATTCCA 

AATTGTTATT 

2460 
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GTTGGTCAAG 

CGCCCAACAA 

CAATCGGAAC 

ATCACTACAA 

ACGAGAGAGC 

TGCCAGTGTT 

2520 

ACTCTGTTCC ' 

ATTTAGTGAC 

TGGAATGACG 

TATAAAATTC 

GTGTAGCGGC 

TAGAAGCAAT 

2580 

GGTGGAGTTG 

GAGTCTCACA 

TGGAACGAGT 

GAAGTCATCA 

TGAATCAAGA 

CACGCTGGAA 

2640 

AAACACCTTG 

CTGCTCAACA 

AGAAAACGAA 

TCATTTTTGT 

ATGGGCTGAT 

CAATAAATCT 

2700 

CATGTTCCTG 

TGATTGTCAT 

TGTTGCAATT 

CTGATTATTT 

TCGTAGTCAT 

CATTATAGCC 

2760 

TATTGTTACT 

GGAGGAATAG 

CAGAAACAGT 

GATGGAAAGG 

ATCGAAGTTT 

TATAAAGATC 

2820 

AATGATGGAA 

GTGTTCATAT 

GGCTTCGAAT 

AATCTTTGGG 

ATGTTG C AC A 

AAATCCGAAT 

2880 

CAGAATCCAA 

TGTACAACAC 

TGCTGGAAGA 

ATGACTATGA 

ACAATAGAAA 

TGGCCAGGCT 

2940 

CTCTATTCGC 

TG AC AC C AAA 

TGCGCAAGAC 

TTTTTCAACA 

ATTGTGATGA 

CTACAGTGGA 

3000 

ACGATGCACA 

GACCAGGATC 

CGAGCATCAC 

TATCATTATG 

CTCAACTGAC 

TGGCGGACCT 

3060 

GGTAATGCGA 

TGTCTACTTT 

TTATGGAAAC 

CAATATCACG 

ATGATCCATC 

TCCATATGCC 

3120 

ACCACAACAC 

TGGTCCTGTC 

GAACCAACAA 

CCAGCTTGGC 

TCAATGACAA 

AATGCTTCGC 

3180 

GCGCCAGCAA 

TGCCAACAAA 

TCCCGTGCCA 

CCAGAGCCAC 

CGGCGCGATA 

TGCAGATCAT 

3240 

AC CGCTGG AA 

GACGATCTCG 

ATCGAGCCGT 

GCATCCGATG 

GGAGAGGAAC 

TCTGAATGGC 

3 3 00 

GGACTCCATC 

ACCGGACTAG 

CGGAAGTCAA 

CGGTCGGATA 

GTCCACCTCA 

CACAGATGTG 

3360 

AGCTATGTTC 

AGCTTCACTC 

ATCCGATGGA 

ACTGGTAGTA 

GTAAGGAAAG 

AACTGGGGAG 

3420 

CGGAGAACAC 

CACCGAATAA 

GACTCTGATG 

GACTTTATTC 

CGCCACCACC 

TTCCAATCCA 

3480 

CCACCACCTG 

GAGGGCACGT 

TTATGACACA 

GCAACTAGGC 

GTCAGTTGAA 

TCGTGGAAGT 

3540 

ACTCCACGAG 

AAGACACCTA 

CGATTCGGTC 

AGTGACGGAG 

CTTTTGCTCG 

GGTTGATGTG 

3600 

AATGCAAGGC 

CAACGAGTCG 

GAATCGGAAT 

TTGGGAGGAA 

GGCCGCTGAA 

AGGGAAACGA 

3660 

G ACG AC GAT A 

GTCAGCGGTC 

TTCGTTGATG 

ATGGACGATG 

ATGGTGGATC 

TTCTGAAGCT 

3720 

GACGGGGAGA 

ACTCTGAAGG 

AGACGTTCCG 

CGTGGAGGTG 

TTAGAAAAGC 

AGTTCCTCGA 

3780 

ATGGGTATCT 

CTGCAAGTAC 

GCTGGCTCAT 

AGTTGTTACG 

GGACAAACGG 

CACTGCTCAA 

3840 

CGATTCCGGT 

CAATTCCACG 

TAACAATGGA 

ATCGTCACAC 

AAGAACAAAC 

TTGA 

3894 


(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 97 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 

Met Tyr Tyr Leu Gly Phe Tyr His Thr His Thr His Thr His Thr Tyr 

15 10 15 

lie Asn Phe Asp Lys lie Pro Asn Ala Ser Asn Leu Ala Pro Val lie 

20 25 30 

lie Glu His Pro lie Asp Val Val Val Ser Arg Gly Ser Pro Ala Thr 
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35 

Leu Asn Cys Gly Ala 
50 

Asp Gly Gin Pro Val 
65 

lie Val Leu Asp Thr 
85 

Lys Asn Gly Lys Asp 
100 

Asn Glu His Gly Glu 
115 

Met Leu Arg Glu Asp 
130 

Gly Gly Glu Met Ala 
145 

Glu Pro Val Val Ser 
165 

Asp Met .Pro Arg Tyr 
180 

Pro Val Asp Arg Ser 
195 

Met Val Gly Glu Arg 
210 

Lys Pro Lys Phe Glu 
225 

Ala Ala Val Leu Phe 
245 

lie Thr Trp Lys Arg 
260 

lie Ala Lys Asp Asn 
275 

Asp Glu Gly Glu Tyr 
290 

Glu Ala Ser Ala His 
305 

Lys Pro Ala Asp Gin 
325 

Cys Thr Leu Val Gly 
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Lys Pro Ser Thr Ala 
55 

lie Thr Asn Lys Glu 
70 

Gly Ser Leu Phe Leu 
90 

Ser Asp Ala Gly Ala 
105 

Val Lys Ser Asn Glu 
120 

Phe Arg Val Arg Pro 
135 

Val Leu Glu Cys Ser 
150 

Trp Arg Lys Asp Asp 
170 

Thr Leu His Ser Asp 
185 

Asp Ser Gly Thr Tyr 
200 

Val Ser Asn Pro Ala 
215 

Gin Glu Pro Lys Asp 
230 

Asp Cys Arg Val Thr 
250 

Lys Asn Glu Pro Met 
265 

Arg Gly Leu Arg lie 
280 

Val Cys Tyr Ala Arg 
295 

Leu Arg Val Gin Ala 
310 

Ser Val Pro Ala Gly 
330 

Gin Pro Ser Pro Ala 
54 


45 

Lys lie Thr Trp Tyr 
60 

Gin Val Asn Ser His 
75 

Leu Lys Val Asn Ser 
95 

Tyr Tyr Cys Val Ala 
110 

Gly Ser Leu Lys Leu 
125 

Arg Thr Val Gin Ala 
140 

Pro Pro Arg Gly Phe 
155 

Lys Glu Leu Arg lie 
175 

Gly Asn Leu lie lie 
190 

Gin Cys Val Ala Asn 
205 

Arg Leu Ser Val Phe 
220 

Met Thr Val Asp Val 
235 

Gly Asp Pro Gin Pro 
255 

Pro Val Thr Arg Ala 
270 

Glu Arg Val Gin Pro 
285 

Asn Pro Ala Gly Thr 
300 

Pro Pro Ser Phe Gin 
315 

Gly Thr Ala Thr Phe 
335 

Tyr Phe Trp Ser Lys 


Lys 

Arg 

80 

Gly 

Ser 

Ala 

Leu 

Pro 
160 
Gin 

Asp 

Asn 

Glu 

Gly 
240 
Gin 

Tyr 

Ser 

Leu 

Thr 
320 
Glu 

Glu 
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340 

Gly Gin Gin Asp Leu 
355 

Thr Lys Val Ser Pro 
'370 

Val Asp Glu Gly Ala 
385 

Ser Leu Ser Lys Ala 
405 

Val Gin Lys Lys Lys 
420 

Gin Ser lie lie Lys 
435 

Ala Lys Pro Pro Pro 
450 

Met Val Gly Ser Ser 
465 

Thr Pro Gly lie Ser 
485 

Asp Ser Arg lie Ser 
500 

Leu Lys Lys Pro Asp 
515 

Asp Gly Glu Ser Thr 
530 

Ser Asn Ala Gin Phe 
545 

Ser Pro Thr Gin Pro 
565 

Leu His Trp Asn Ala 
580 

Tyr lie lie Gin Tyr 
595 

lie Pro Asp Tyr Val 
610 

Pro Ser His Ser Tyr 
625 

lie Gly Thr Pro Ser 


. 345 

Leu Phe Pro Ser Tyr 
360 

Thr Gly Thr Leu Thr 
375 

Tyr Val Cys Ala Gly 
390 

Ala Leu Lys Ala Thr 
410 

Ser Lys Met Gly Lys 
425 

Tyr Leu lie Ser Ala 
440 

Thr lie Glu His Gly 

455 

Ala lie Leu Pro Cys 
470 

Trp Leu Arg Asp Gly 
490 

Gin His Ser Thr Gly 
505 

Thr Gly Val Tyr Thr 
520 

Trp Ser Ala Ser Leu 
535 

Val Arg Met Pro Asp 
550 

lie lie Val Asn Val 
570 

Pro Ser Thr Ser Gly 
585 

Tyr Ser Pro Asp Leu 
600 

Ala Ser Thr Glu Tyr 
615 

Met Phe Val lie Arg 
630 

Val Ser Ser Ala Leu 
55 


350 

Val Ser Ala Asp Gly 
365 

lie Glu Glu Val Arg 
380 

Met Asn Ser Ala Gly 
395 

Phe Glu Thr Lys Gly 
415 

Gin Lys Gin Lys Asn 
430 

Val Thr Gly Asn Thr 
445 

His Gin Asn Gin Thr 
460 

Gin Ala Ser Gly Lys 
475 

Leu Pro lie Asp lie 
495 

Ser Leu His lie Ala 
510 

Cys lie Ala Lys Asn 
525 

Thr Val Glu Asp His 
540 

Pro Ser Asn Phe Pro 
555 

Thr Asp Thr Glu Val 
575 

Ala Gly Pro lie Thr 
590 

Gly Gin Thr Trp Phe 
605 

Arg He Lys Gly Leu 
620 

Ala Glu Asn Glu Lys 
635 

Val Thr Thr Ser Lys 


Arg 

Gin 

Ser 
400 
Arg 

Val 

Pro 

Leu 

Pro 
480 
Thr 

Asp 

Glu 

Thr 

Ser 
560 
Glu 

Gly 

Asn 

Lys 

Gly 
640 
Pro 
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Ala Ala Gin Val 
660 

Ala Glu Lys Arg 
675 

Lys Thr lie Asn 
690 

Leu Glu Glu Leu 
705 

Arg Thr Asn Asp 

Asn Tyr Val Val 
740 

Val lie Pro Tyr 
755 

Asn Ser Met Asp 
770 

Glu Asp Val Arg 
785 

Trp Lys Ala Pro 

Gin lie Val lie 
820 

Thr Asn Glu Arg 
835 

Met Thr Tyr Lys 
850 

Val Ser His Gly 
865 

Lys His Leu Ala 

lie Asn Lys Ser 
900 

He Phe Val Val 
915 

Asn Ser Asp Gly 
930 

Val His Met Ala 


645 

Ala Leu Ser Asp 

Leu Thr Ser Glu 
680 

Ser Thr Ala Val 
695 

He Asp Gly Tyr 
710 

Asn Gin Tyr Val 
725 

Ser Asn Leu Met 

His Ser Gly Val 
760 

Val Leu Thr Ala 
775 

He Arg Met Leu 
790 

Lys Ala Asp Gly 
805 

Val Gly Gin Ala 

Ala Ala Ser Val 
840 

He Arg Val Ala 
855 

Thr Ser Glu Val 
870 

Ala Gin Gin Glu 
885 

His Val Pro Val 

He He He Ala 
920 

Lys Asp Arg Ser 
935 

Ser Asn Asn Leu 


650 

Lys Asn Lys Met 
665 

Gin Leu He Lys 

Arg Leu Phe Trp 
700 

Tyr He Lys Trp 
715 

Asn Val Thr Ser 
730 

Pro Phe Thr Asn 
745 

His Ser He His 

Glu Ala Pro Pro 
780 

Asn Leu Thr Thr 
795 

He Asn Gly He 
810 

Pro Asn Asn Asn 
825 

Thr Leu Phe His 

Ala Arg Ser Asn 
860 

He Met Asn Gin 
875 

Asn Glu Ser Phe 
890 

He Val He Val 
905 

Tyr Cys Tyr Trp 

Phe He Lys He 
940 

Trp Asp Val Ala 


655 

Asp Met Ala He 
670 

Leu Glu Glu Val 
685 

Lys Lys Arg Lys 

Arg Gly Pro Pro 
720 

Pro Ser Thr Glu 
735 

Tyr Glu Phe Phe 
750 

Gly Ala Pro Ser 
765 

Ser Leu Pro Pro 

Leu Arg He Ser 
800 

Leu Lys Gly Phe 
815 

Arg Asn He Thr 
830 

Leu Val Thr Gly 
845 

Gly Gly Val Gly 

Asp Thr Leu Glu 
880 

Leu Tyr Gly Leu 
895 

Ala He Leu He 
910 

Arg Asn Ser Arg 
925 

Asn Asp Gly Ser 
Gin Asn Pro Asn 


945 950 955 960 

Gin Asn Pro Met Tyr Asn Thr Ala Gly Arg Met Thr Met Asn Asn Arg 

965 970 975 

Asn Gly Gin Ala Leu Tyr Ser Leu Thr Pro Asn Ala Gin Asp Phe Phe 

980 985 990 

Asn Asn Cys Asp Asp Tyr Ser Gly Thr Met His Arg Pro Gly Ser Glu 

995 1000 1005 

His His Tyr His Tyr Ala Gin Leu Thr Gly Gly Pro Gly Asn Ala Met 

1010 1015 1020 

Ser Thr Phe Tyr Gly Asn Gin Tyr His Asp Asp Pro Ser Pro Tyr Ala 
1025 1030 1035 1040 

Thr Thr Thr Leu Val Leu Ser Asn Gin Gin Pro Ala- Trp Leu Asn Asp 

1045 1050 1055 

Lys Met Leu Arg Ala Pro Ala Met Pro Thr Asn Pro Val Pro Pro Glu 

1060 1065 1070 

Pro Pro Ala Arg Tyr Ala Asp His Thr Ala Gly Arg Arg Ser Arg Ser 

1075 1080 1085 

Ser Arg Ala Ser Asp Gly Arg Gly Thr Leu Asn Gly Gly Leu His His 

1090 1095 1100 

Arg Thr Ser Gly Ser Gin Arg Ser Asp Ser Pro Pro His Thr Asp Val 
1105 1110 1115 1120 

Ser Tyr Val Gin Leu His Ser Ser Asp Gly Thr Gly Ser Ser Lys Glu 

1125 1130 1135 

Arg Thr Gly Glu Arg Arg Thr Pro Pro Asn Lys Thr Leu Met Asp Phe 

1140 1145 1150 

lie Pro Pro Pro Pro Ser Asn Pro Pro Pro Pro Gly Gly His Val Tyr 

1155 1160 1165 

Asp Thr Ala Thr Arg Arg Gin Leu Asn Arg Gly Ser Thr Pro Arg Glu 

1170 1175 1180 

Asp Thr Tyr Asp Ser Val Ser Asp Gly Ala Phe Ala Arg Val Asp Val 
1185 1190 1195 1200 

Asn Ala Arg Pro Thr Ser Arg Asn Arg Asn Leu Gly Gly Arg Pro Leu 

1205 1210 1215 

Lys Gly Lys Arg Asp Asp Asp Ser Gin Arg Ser Ser Leu Met Met Asp 

1220 1225 1230 

Asp Asp Gly Gly Ser Ser Glu Ala Asp Gly Glu Asn Ser Glu Gly Asp 

1235 1240 1245 

Val Pro Arg Gly Gly Val Arg Lys Ala Val Pro Arg Met Gly lie Ser 
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1250 1255 1260 

Ala Ser Thr Leu Ala His Ser Cys Tyr Gly Thr Asn Gly Thr Ala Gin 
1265 1270 1275 1280 

Arg Phe Arg Ser lie Pro Arg Asn Asn Gly lie Val Thr Gin Glu Gin 
1285 1290 1295 

Thr 

(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 


ATGAAATGGA 

AACATGTTCC 

TTTTTTGGTC 

ATGATATCAC 

TCCTCAGCTT 

ATCCCCAAAT 

60 

CACCTGTTTC 

TGGCCCAGCT 

TATTCCAGAC 

CCTGAAGATG 

TAGAGAGGGG 

GAACGACCAC 

120 

GGGACGCCAA 

TCCCCACCTC 

TGATAACGAT 

GACAATTCGC 

TGGGCTATAC 

AGGCTCCCGT 

180 

CTTCGTCAGG 

AAGATTTTCC 

ACCTCGCATT 

GTTGAACACC 

CTTCAGa4;CT 

GATTGTCTCA 

240 

AAAGGAGAAC 

CTGCAACTTT 

GAACTGCAAA 

GCTGAAGGCC 

GCCCCACACC 

CACTATTGAA 

300 

TGGTACAAAG 

GGGGAGAGAG 

AGTGGAGACA 

GACAAAGATG 

ACCCTCGCTC 

ACACCGAATG 

360 

TTGCTGCCGA 

GTGGATCTTT 

ATTTTTCTTA 

CGTATAGTAC 

ATGGACGGAA 

AAGTAGACCT 

420 

GATGAAGGAG 

TCTATGTCTG 

TGTAGCAAGG 

AATTACCTTG 

GAGAGGCTGT 

GAGCCACAAT 

480 

GCATCGCTGG 

AAGTAGCCAT 

A^TTCGGGAT 

GACTTCAGAC 

AAAACCCTTC 

GGATGTCATG 

540 

GTTGCAGTAG 

GAGAGCCTGC 

AGTAATGGAA 

TGCCAACCTC 

CACGAGGCCA 

TCCTGAGCCC 

600 

ACCATTTCAT 

GGAAGAAAGA 

TGGCTCTCCA 

CTGGATGATA 

AAGATGAAAG 

AATAACTATA 

660 

CGAGGAGGAA 

AGCTCATGAT 

CACTTACACC 

CGTAAAAGTG 

ACGCTGGCAA 

ATATGTTTGT 

720 

GTTGGTACCA 

ATATGGTTGG 

GGAACGTGAG 

AGTGAAGTAG 

CCGAGCTGAC 

TGTCT^AGAG 

780 

AGACCATCAT 

TTGTGAAGAG 

ACCCAGTAAC 

TTGGCAGTAA 

CTGTGGATGA 

CAGTGCAGAA 

840 

TTTAAATGTG 

AGGCCCGAGG 

TGACCCTGTA 

CCTACAGTAC 

GATGGAGGAA 

AGATGATGGA 

900 

GAGCTGCCCA 

AAT C C AG AT A 

TGAAATCCGA 

GATGATCATA 

CCTTGAAAAT 

TAGGAAGGTG 

960 

ACAGCTGGTG 

ACATGGGTTC 

ATACACTTGT 

GTTGCAGAAA 

ATATGGTGGG 

CAAAGCTGAA 

1020 

GCATCTGCTA 

CTCTGACTGT 

TCAAGAAgCT CCACATTTTG 

TTGTGAAACC 

CCGTGACCAG 

1080 

GTTGTTGCTT 

TGGGACGGAC 

TGTAACTTTT 

CAGTGTGAAG 

CAACCGGAAA 

TCCTCAACCA 

1140 

GCTATTTTCT 

GGAGGAGAGA 

AGGGAGTCAG 

AATCTACTTT 

TCTCATATCA 

ACCACCACAG 

1200 

TCATCCAGCC 

GATTTTCAGT 

CTCCCAGACT 

GGCGACCTCA 

CAATTACTAA 

TGTCCAGCGA 

1260 

TCTGATGTTG 

GTTATTACAT 

CTGCCAGACT 

TTAAATGTTG 

CTGGAAGCAT 

CATCACAAAG 

1320 

GCATATTTGG 

AAGTTACAGA 

tgtgattgca; GATCGGCCTC 

CCCCAGTTAT 

TCGACAAGGT 

1380 
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CCTGTGAATC 

AGACTGTAGC 

CGTGGATGGC 

ACTTTCGTCC 

TCAGCTGTGT 

GGCCACAGGC 

1440 

AGTCCAGTGC 

CCACCATTCT 

GTGGAGAAAG 

GATGGAGTCC 

TCGTTTCAAC 

CCAAGACTCT 

1500 

CGAATCAAAC 

AGTTGGAGAA 

TGGAGTACTG 

CAGATCCGAT 

ATGCTAAGCT 

GGGTGATACT 

1560 

GGTCGGTACA 

CCTGCATTGC 

ATCAACCCCC 

AGTGGTGAAG 

CAACATGGAG 

TGCTTACATT 

1620 

GAAGTTCAAG 

aatttgJgagt 

TCCAGTTCAG 

CCTCCAAGAC 

CTACTGACCC 

AAATTTAATC 

1680 

CCTAGTGCCC 

CATCAAAACC 

TGAAGTGACA 

GATGTCAGCA 

GAAATACAGT 

CACATTATCG 

1740 

TGGCAACCAA 

ATTTGAATTC 

AGGAGCAACT 

CCAACATCTT 

ATATTATAGA 

AGCCTTCAGC 

1800 

CATGCATCTG 

GTAGCAGCTG 

GCAGACCGTA 

GCAGAGAATG 

TGAAAACAGA 

AACATCTGCC 

1860 

ATTAAAGGAC 

TCAAACCTAA 

TGCAATTTAC 

CTTTTCCTTG 

TGAGGGCAGC 

TAATGCATAT 

1920 

GGAATTAGTG 

ATCCAAGCCA 

AATATCAGAT 

CCAGTGAAAA 

CACAAGATGT 

CCTACCAACA 

1980 

AGTCAGGGGG 

TGGACCACAA 

GCAGGTCCAG 

AGAGAGCTGG 

GAAATGCTGT 

TCTGCACCTC 

2040 

CACAACCCCA 

CCGTCCTTTC 

TTCCTCTTCC 

ATCGAAGTGC 

ACTGGACAGT 

AGATCAACAG 

2100 

TCTCAGTATA 

TACAAGGATA 

TAAAATTCTC 

TATCGGCCAT 

CTGGAGCCAA 

CCACGGAGAA 

2160 

TCAGACTGGT 

TAGTTTTTGA 

AGTGAGGACG 

CCAGCCAAAA 

ACAGTGTGGT 

AATCCCTGAT 

2220 

CTCAGAAAGG 

GAGTCAACTA 

TGAAATTAAG 

GCTCGCCCTT 

TTTTTAATGA 

ATTTCAAGGA 

2280 

GCAGATAGTG 

AAATCAAGTT 

TGCCAAAACC 

CTGGAAGAAG 

CACCCAGTGC 

CCCACCCCAA 

2340 

GGTGTAACTG 

TATCCAAGAA 

TGATGGAAAC 

GGAACTGCAA 

TTCTAGTTAG 

TTGGCAGCCA 

2400 

CCTCCAGAAG 

ACACTCAAAA 

TGGAATGGTC 

CAAGAGTATA 

AGGTTTGGTG 

TCTGGGCAAT 

2460 

GAAACTCGAT 

ACCACATCAA 

CAAAACAGTG 

GATGGTTCCA 

CCTTTTCCGT 

GGTCATTCCC 

2520 

TTTCTTGTTC 

CTGGAATCCG 

ATACAGTGTG 

GAAGTGGCAG 

CCAGCACTGG 

GGCTGGGTCT 

2580 

GGGGTAAAGA 

GTGAGCCTCA 

GTTCATCCAG 

CTGGATGCCC 

ATGGAAACCC 

TGTGTCACCT 

2640 

GAGGACCAAG 

TCAGCCTCGC 

TCAGCjAGATT 

TCAGATGTGG 

TGAAGCAGCC 

GGCCTTCATA 

2700 

GCAGGTATTG 

GAGCAGCCTG 

TTGGATCATC 

CTCATGGTCT 

TCAGCATCTG 

GCTTTATCGA 

2760 

caccg'caaga 

AGAGAAACGG 

ACTTACTAGT 

ACCTACGCGG 

GTATCAGAAA 

AGTCCCGTCT 

2820 

TTTACCTTCA 

CACCAACAGT 

AACTTACCAG 

AGAGGAGGCG 

AAGCTGTCAG 

CAGTGGAGGG 

2880 

AGGCCTGGAC 

TTCTCAACAT 

CAGTGAACCT 

GCCGCGCAGC 

CATGGCTGGC 

AGACACGTGG 

2940 

CCTAATACTG 

GCAACAACCA 

CAATGACTGC 

TCCATCAGCT 

GCTGCACGGC 

AGGCAATGGA 

3000 

AACAGCGACA 

GCAACCTCAC 

TACCTACAGT 

CGCCCAGCTG 

ATTGTATAGC 

AAATTATAAC 

3060 

AACCAACTGG 

ATAACAAACA 

AACAAATCTG 

ATGCTCCCTG 

AGTCAACTGT 

TTATGGTGAT 

3120 

GTGGACCTTA 

GTAACAAAAT 

CAATGAGATG 

AAAACCTTCA 

AT AG C C C AAA 

JTCTGAAGGAT 

3180 

GGGCGTTTTG 

TCAATCCATC 

AGGGCAGCCT 

ACTCCTTACG 

CCACCACTCA 

GCTCATCCAG 

3240 

TCAAACCTCA 

GCAACAACAT 

GAACAATGGC 

AGCGGGGACT 

CTGGCGAGAA 

GCACTGGAAA 

3300 

CCACTGGGAC 

AG C AG AAAC A 

AGAAGTGGCA 

CCAGTTCAGT 

ACAACATCGT 

GG AG CAAAAC 

3360 

AAGCTGAACA 

AAGATTATCG 

AGCAAATGAC 

ACAGTTCCTC 

CAACTATCCC 

ATACAACCAA 

3420 

TCATACGACC 

AGAACACAGG 

AGGATCCTAC 

AACAGCTCAG 

ACCGGGGCAG 

TAG T AC AT C T 

3480 

GGGAGTCAGG 

GGCACAAGAA 

AGGGGCAAGA 

ACACCCAAGG 

TACCAAAACA 

GGGTGGCATG 

3540 

AACTGGGCAG 

ACCTGCTTCC 

TCCTCCCCCA 

GCACATCCTC 

CTCCACACAG 

CAATAGCGAA 

3600 

GAGTACAACA 

TTTCTGTAGA 

T G AAAG C T AT 

GACCAAGAAA 

TGCCATGTCC 

CGTGCCACCA 

3660 
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GCAAGGATGT 

ATTTGCAACA 

AGATGAATTA 

GAAGAGGAGG 

AAGATGAACG 

AGGCCCCACT 

3720 

CCCCCTGTTC 

GGGGAGCAGC 

TTCTTCTCCA 

GCTGCCGTGT 

CCTATAGCCA 

TCAGTCCACT 

3780 

GCCACTCTGA 

CTCCCTCCCC 

ACAGGAAGAA 

CTtcAGCQCA 

TGTTACAGGA 

TTGTCCAGAG 

3840 

GAGACTGGCC 

ACATG C AG C A 

CCAGCCqGAC 

AGGAGACGGC 

AGCCTGTGAG 

TCCTCCTCCA 

3900 

CCACCACGGC 

CGATCTCCCC 

TCCACATACC 

TATGGCTACA 

TTTCAGGACC 
CCGACATGGA 

CCTGGTCTCA 

3960 

GATATGGATA 

CGGATGCGCC 

AGAAGAGGAA 

GAAGACGAAG 

GGTAGCCAAG 

4020 

ATGCAAACCA 

GAAGGCTTTT 

GTTACGTGGG 

CTTGAGCAGA 

CACCTGCCTC 

CAG TGTTGGG 

4080 

GAC CTGG AG A 

GCTCTGTCAC 

GGGGTCCATG 

ATCAACGGCT 

GGGGCTCAGC 

CTCAGAGGAG 

4140 

GACAACATTT 

CCAGCGGACG 

CTCCAGTGTT 

AGTTCTTCGG 

ACGGCTCCTT 

TTTCACTGAT 

4200 

GCTGACTTTG 

CCCAGGCAGT 

CGCAGCAGCG 

GCAGAGTATG 

CTGGTCTGAA 

AGTAGCACGA 

4260 

CGGCAAATGC 

AGGATGCTGC 

TGGCCGTCGA 

CATTTTCATG 

CGTCTCAGTG 

CCCTAGGCCC 

4320 

ACAAGTCCCG 

TGT C T AC AG A 

CAGCAACATG 

AGTGCCGCCG 

TAATGCAGAA 

AAC CAG AtC A 

4380 

GCCAAGAAAC 

TGAAACACCA 

GCCAGGACAT 

CTGCGCAGAG 

AAACCTACAC 

AGATGATCTT 

4440 

CCACCACCTC 

CTGTGCCGCC 

ACCTGCTATA 

AAGTCACCTA 

CTGCCCAATC 

CAAGAC^CAG 

4500 

CTGGAAGTAC 

GACCTGTAGT 

GGTGCCAAAA 

CTCCCTTCTA 

TGGATGCAAG 

AACAGACAGA] 

4560 

TCATCAGACA 

GAAAAGGAAG 

CAGTTACAAG 

GGGAGAGAAG 

TGTTGGATGG 

AAGACAGGTT 

4620 

GTTGACATGC 

GAACAAATCC 

AGGTGATCCC 

AGAGAAGCAC 

AGGAACAGCA 

AAATGACGGG 

4680 

AAAGGACGTG 

GAAACAAGGC 

AGCAAAACGA 

GACCTTCCAC 

C AG C AAAG AC 

TCATCTCATC 

4740 

CAAGAGGATA 

TTCTACCTTA 

TTGTAGACCT 

ACTTTTCCAA 

CATCAAATAA 

TCCCAGAGAT 

4 800 

CCCAGTTCCT 

CAAGCTCAAT 

GTCATCAAGA 

GGATCAGGAA 

G C AG AC AAAG 

AGAACAAGCA 

4860 

AATGTAGGTC 

GAAGAAATAT 

TG C AG AAATG 

CAGGTACTTG 

GAGGATATGA 

AAGAGGAGAA 

4920 

GATAATAATG 

AAGAATTAGA 

GGAAACTGAA 

AGCTGA 



4956 


(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1651 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

Met Lys Trp Lys His Val Pro Phe Leu Val Met lie Ser Leu Leu Ser 

1 5-10 15 

Leu Ser Pro Asn His Leu Phe Leu Ala Gin Leu lie Pro Asp Pro Glu 

2 0 2 5 3 0 

Asp Val Glu Arg Gly Asn Asp His Gly Thr Pro lie Pro Thr Ser Asp 

35 40 45 

Asn Asp Asp Asn Ser Leu Gly Tyr Thr Gly Ser Arg Leu Arg Gin Glu 
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50 

Asp Phe Pro Pro 
65 

Lys Gly Glu Pro 

Pro Thr lie Glu 
100 

Asp Asp Pro Arg 
115 

Phe Leu Arg lie 
130 

Tyr Val Cys Val 
145 

Ala Ser Leu Glu 

Ser Asp Val Met 
180 

Pro Pro Arg Gly 
195 

Ser Pro Leu Asp 
210 

Leu Met lie Thr 
225 

Val Gly Thr Asn 

Thr Val Leu Glu 
260 

Val Thr Val Asp 

275 

Pro Val Pro Thr 
290 

Ser Arg Tyr Glu 
305 

Thr Ala Gly Asp 

Gly Lys Ala Glu 
340 

Phe Val Val Lys 


55 

Arg lie Val Glu 
70 

Ala Thr Leu Asn 
85 

Trp Tyr Lys Gly 

Ser His Arg Met 
120 

Val His Gly Arg 
135 

Ala Arg Asn Tyr 
150 

Val Ala lie Leu 
165 

Val Ala Val Gly 

His Pro Glu Pro 
200 

Asp Lys Asp Glu 
215 

Tyr Thr Arg Lys 
230 

Met Val Gly Glu 
245 

Arg Pro Ser Phe 

Asp Ser Ala Glu 

280 

Val Arg Trp Arg 
295 

lie Arg Asp Asp 
310 

Met Gly Ser Tyr 
325 

Ala Ser Ala Thr 
Pro Arg Asp Gin 


60 

His Pro Ser Asp 
75 

Cys Lys Ala Glu 
90 

Gly Glu Arg Val 
105 

Leu Leu Pro Ser 

Lys Ser Arg Pro 
140 

Leu Gly Glu Ala 
155 

Arg Asp Asp Phe 
170 

Glu Pro Ala Val 
185 

Thr lie Ser Trp 

Arg lie Thr lie 
220 

Ser Asp Ala Gly 
235 

Arg Glu Ser Glu 
250 

Val Lys Arg Pro 
265 

Phe Lys Cys Glu 

Lys Asp Asp Gly 
300 

His Thr Leu Lys 
315 

Thr Cys Val Ala 
330 

Leu Thr Val Gin 
345 

Val Val Ala Leu 


Leu lie Val Ser 
80 

Gly Arg Pro Thr 
95 

Glu Thr Asp Lys 
110 

Gly Ser Leu Phe 
125 

Asp Glu Gly Val 

Val Ser His Asn 
160 

Arg Gin Asn Pro 
175 

Met Glu Cys Gin 
190 

Lys Lys Asp Gly 
205 

Arg Gly Gly Lys 

Lys Tyr Val Cys 
240 

Val Ala Glu Leu 
255 

Ser Asn Leu Ala 
270 

Ala Arg Gly Asp 
285 

Glu Leu Pro Lys 

lie Arg Lys Val 
320 

Glu Asn Met Val 
335 

Glu Pro Pro His 
350 

Gly Arg Thr Val 
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355 

Thr Phe Gin Cys Glu 
370 

Arg Arg Glu Gly Ser 
385 

Ser Ser Ser Arg Phe 
405 

Asn Val Gin Arg Ser 
420 

Val Ala Gly Ser lie 
435 

lie Ala Asp Arg Pro 
450 

Thr Val Ala Val Asp 
465 

Ser Pro Val Pro Thr 
485 

Thr Gin Asp Ser Arg 
500 

Arg Tyr Ala Lys Leu 
515 

Thr Pro Ser Gly Glu 
530 

Phe Gly Val Pro Val 
545 

Pro Ser Ala Pro Ser 
565 

Val Thr Leu Ser Trp 
580 

Ser Tyr lie lie Glu 
595 

Thr Val Ala Glu Asn 
610 

Lys Pro Asn Ala lie 

625 

Gly lie Ser Asp Pro 
645 

Val Leu Pro Thr Ser 


360 

Ala Thr Gly Asn Pro 
375 

Gin Asn Leu Leu Phe 
390 

Ser Val Ser Gin Thr 
410 

Asp Val Gly Tyr Tyr 
425 

lie Thr Lys Ala Tyr 
440 

Pro Pro Val lie Arg 
455 

Gly Thr Phe Val Leu 
470 

lie Leu Trp Arg Lys 
490 

lie Lys Gin Leu Glu 
505 

Gly Asp Thr Gly Arg 
520 

Ala Thr Trp Ser Ala 
535 

Gin Pro Pro Arg Pro 
550 

Lys Pro Glu Val Thr 
570 

Gin Pro Asn Leu Asn 
585 

Ala Phe Ser His Ala 
600 

Val Lys Thr Glu Thr 
615 

Tyr Leu Phe Leu Val 
630 

Ser Gin lie Ser Asp 
650 

Gin Gly Val Asp His 
62 


365 

Gin Pro Ala lie Phe 
380 

Ser Tyr Gin Pro Pro 
395 

Gly Asp Leu Thr lie 
415 

lie Cys Gin Thr Leu 
430 

Leu Glu Val Thr Asp 
445 

Gin Gly Pro Val Asn 
460 

Ser Cys Val Ala Thr 

475 

Asp Gly Val Leu Val 
495 

Asn Gly Val Leu Gin 
510 

Tyr Thr Cys lie Ala 
525 

Tyr lie Glu Val Gin 
540 

Thr Asp Pro Asn Leu 
555 

Asp Val Ser Arg Asn 
575 

Ser Gly Ala Thr Pro 
590 

Ser Gly Ser Ser Trp 
605 

Ser Ala lie Lys Gly 
620 

Arg Ala Ala Asn Ala 
635 

Pro Val Lys Thr Gin 
655 

Lys Gin Val Gin Arg 


Trp 

Gin 
400 
Thr 

Asn 

Val 

Gin 

Gly 
480 

Ser 

He 

Ser 

Glu 

He 
560 
Thr 

Thr 

Gin 

Leu 

Tyr 
640 
Asp 

Glu 
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660 

Leu Gly Asn Ala 
675 

Ser Ser lie Glu 
690 

Gin Gly Tyr Lys 
705 

Ser Asp Trp Leu 

Val lie Pro Asp 
740 

Pro Phe Phe Asn 
755 

Lys Thr Leu Glu 
770 

Ser Lys Asn Asp 
785 

Pro Pro Glu Asp 

Cys Leu Gly Asn 
820 

Ser Thr Phe Ser 
835 

Ser Val Glu Val 
850 

Glu Pro Gin Phe 
865 

Glu Asp Gin Val 

Pro Ala Phe lie 
900 

Val Phe Ser lie 
915 

Thr Ser Thr Tyr 
930 

Pro Thr Val Thr 
945 

Arg Pro Gly Leu 


Val Leu His Leu 
680 

Val His Trp Thr 
695 

lie Leu Tyr Arg 
710 

Val Phe Glu Val 
725 

Leu Arg Lys Gly 

Glu Phe Gin Gly 
760 

Glu Ala Pro Ser 
775 

Gly Asn Gly Thr 
790 

Thr Gin Asn Gly 
805 

Glu Thr Arg Tyr 

Val Val lie Pro 
840 

Ala Ala Ser Thr 
855 

lie Gin Leu Asp 
870 

Ser Leu Ala Gin 
885 

Ala Gly lie Gly 

Trp Leu Tyr Arg 
920 

Ala Gly lie Arg 
935 

Tyr Gin Arg Gly 
950 

Leu Asn lie Ser 


665 

His Asn Pro Thr 

Val Asp Gin Gin 
700 

Pro Ser Gly Ala 
715 

Arg Thr Pro Ala 
730 

Val Asn Tyr Glu 
745 

Ala Asp Ser Glu 

Ala Pro Pro Gin 
780 

Ala lie Leu Val 
795 

Met Val Gin Glu 
810 

His lie Asn Lys 
825 

Phe Leu Val Pro 

Gly Ala Gly Ser 
860 

Ala His Gly Asn 
875 

Gin He Ser Asp 
890 

Ala Ala Cys Trp 
905 

His Arg Lys Lys 

Lys Val Pro Ser 
940 

Gly Glu Ala Val 

955 

Glu Pro Ala Ala 


670 

Val Leu Ser Ser 
685 

Ser Gin Tyr He 

Asn His Gly Glu 
720 

Lys Asn Ser Val 
735 

lie Lys Ala Arg 
750 

He Lys Phe Ala 
765 

Gly Val Thr Val 

Ser Trp Gin Pro 
800 

Tyr Lys Val Trp 
815 

Thr Val Asp Gly 
830 

Gly He Arg Tyr 
845 

Gly Val Lys Ser 

Pro Val Ser Pro 
880 

Val Val Lys Gin 
895 

He He Leu Met 
910 

Arg Asn Gly Leu 
925 

Phe Thr Phe Thr 

Ser Ser Gly Gly 
960 

Gin Pro Trp Leu 


63 
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965 970 975 

Ala Asp Thr Trp Pro Asn Thr Gly Asn Asn His Asn Asp Cys Ser lie 
980 985 990 

Ser Cys Cys Thr Ala Gly Asn Gly Asn Ser Asp Ser Asn Leu Thr Thr 

995 1000 1005 

Tyr Ser Arg Pro Ala Asp Cys lie Ala Asn Tyr Asn Asn Gin Leu Asp 

1010 1015 1020 

Asn Lys Gin Thr Asn Leu Met Leu Pro Glu Ser Thr Val Tyr Gly Asp 
1025 1030 1035 1040 

Val Asp Leu Ser Asn Lys lie Asn Glu Met Lys Thr Phe Asn Ser Pro 

- 1045 1050 1055 

Asn Leu Lys Asp Gly Arg Phe Val Asn Pro Ser Gly Gin Pro Thr Pro 

1060 1065 1070 

Tyr Ala Thr Thr Gin Leu lie Gin Ser Asn Leu Ser Asn Asn Met Asn 

1075 1080 1085 

Asn Gly Ser Gly Asp Ser Gly Glu Lys His Trp Lys Pro Leu Gly Gin 

1090 1095 1100 

Gin Lys Gin Glu Val Ala Pro Val Gin Tyr Asn lie Val Glu Gin Asn 
1105 1110 1115 1120 

Lys Leu Asn Lys Asp Tyr Arg Ala Asn Asp Thr Val Pro Pro Thr lie 

1125 1130 1135 

Pro Tyr Asn Gin Ser Tyr Asp Gin Asn Thr Gly Gly Ser Tyr Asn Ser 

1140 1145 1150 

Ser Asp Arg Gly Ser Ser Thr Ser Gly Ser Gin Gly His Lys Lys Gly 

1155 1160 1165 

Ala Arg Thr Pro Lys Val Pro Lys Gin Gly Gly Met Asn Trp Ala Asp 

1170 1175 1180 

Leu Leu Pro Pro Pro Pro Ala His Pro Pro Pro His Ser Asn Ser Glu 
1185 1190 1195 1200 

Glu Tyr Asn lie Ser Val Asp Glu Ser Tyr Asp Gin Glu Met Pro Cys 

1205 1210 1215 

Pro Val Pro Pro Ala Arg Met Tyr Leu Gin Gin Asp Glu Leu Glu Glu 

1220 1225 1230 

Glu Glu Asp Glu Arg Gly Pro Thr Pro Pro Val Arg Gly Ala Ala Ser 

1235 1240 1245 

Ser Pro Ala Ala Val Ser Tyr Ser His Gin Ser Thr Ala Thr Leu Thr 

1250 1255 1260 

Pro Ser Pro Gin Glu Glu Leu Gin Pro Met Leu Gin Asp Cys Pro Glu 
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1265 1270 1275 1280 

Glu Thr Gly His Met Gin His Gin Pro Asp Arg Arg Arg Gin Pro Val 

1285 1290 1295 

Ser Pro Pro Pro Pro Pro Arg Pro He Ser Pro Pro His Thr Tyr Gly 

1300 1305 1310 

Tyr He Ser ^3ly Pro Leu Val Ser Asp Met Asp Thr Asp Ala Pro Glu 

1315 1320 1325 

Glu Glu Glu Asp Glu Ala Asp Met Glu Val Ala Lys Met Gin Thr Arg 

1330 1335 1340 

Arg Leu Leu Leu Arg Gly Leu Glu Gin Thr Pro Ala Ser Ser Val^/Gly 
1345 1350 1355 1360 

Asp Leu Glu Ser Ser Val Thr Gly Ser Met He Asn Gly Trp Gly Ser 

1365 1370 1375 

Ala Ser Glu Glu Asp Asn He Ser Ser Gly Arg Ser Ser Val Ser Ser 

1380 1385 1390 

Ser Asp Gly Ser Phe Phe Thr Asp Ala Asp Phe Ala Gin Ala Val Ala 

1395 1400 1405 

Ala Ala Ala Glu Tyr Ala Gly Leu Lys Val Ala Arg Arg Gin Met Gin 

1410 1415 1420 

Asp Ala Ala Gly Arg Arg His Phe His Ala Ser Gin Cys Pro Arg Pro 
1425 1430 1435 1440 

Thr Ser Pro Val Ser Thr Asp Ser Asn Met Ser Ala Ala Val Met Gin 

1445 1450 1455 

Lys Thr Arg Pro Ala Lys Lys Leu Lys His Gin Pro Gly His Leu Arg 

1460 1465 1470 

Arg Glu Thr Tyr Thr Asp Asp Leu Pro Pro Pro Pro Val Pro Pro Pro 

1475 1480 1485 

Ala He Lys Ser Pro Thr Ala Gin Ser Lys Thr Gin Leu Glu Val Arg 

1490 1495 1500 

Pro Val Val Val Pro Lys Leu Pro Ser Met Asp Ala Arg Thr Asp Arg 
1505 1510 1515 1520 

Ser Ser Asp Arg Lys Gly Ser Ser Tyr Lys Gly Arg Glu Val Leu Asp 

1525 1530 1535 

Gly Arg Gin Val Val Asp Met Arg Thr Asn Pro Gly Asp Pro Arg Glu 

1540 1545 1550 

Ala Gin Glu Gin Gin Asn Asp Gly Lys Gly Arg Gly Asn Lys Ala Ala 

1555 1560 1565 

Lys Arg Asp Leu Pro Pro Ala Lys Thr His Leu He Gin Glu Asp He 


B98-006 


i 

\ 

1570 1575 1580 

Leu Pro Tyr Cys Arg Pro Thr Phe Pro Thr Ser Asn Asn Pro Arg Asp 
1585 1590 1595 1600 

Pro Ser Ser Ser Ser Ser Met Ser Ser Arg Gly Ser Gly Ser Arg Gin 

1605 1610 1615 

Arg Glu Gin Ala Asn Val Gly Arg Arg Asn lie Ala Glu Met Gin Val 

1620 1625 1630 

Leu Gly Gly Tyr Glu Arg Gly Glu Asp Asn Asn Glu Glu Leu Glu Glu 

1635 1640 1645 

Thr Glu Ser 
1650 

(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/ KEY : misc_f eature 

(B) LOCATION: 855 1187 

(D) OTHER INFORMATION: /note= "N signifies gap in sequence" 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 


CAGATTGTTG 

CTCAAGGTCG 

AACAGTGACA 

TTTCCCTGTG 

AAACTAAAGG 

AAAC C C AC AG 

60 

CCAGCTGTTT 

TTTGGCAGAA 

AGAAGGCAGC 

CAGAACCTAC 

TTTTCCCAAA 

CCAACCCCAG 

120 

CAGCCCAACA 

GTAGATGCTC. 

AGTGTCACCA 

ACTGGAGACC 

TCACAATCAC 

CAACATTCAA 

180 

CGTTCCGACG 

CGGGTTACTA 

CATCTGCCAG 

GCTTTAACTG 

TGGCAGGAAG 

CATTTTAGCA 

240 

AAAGCTC^AC-JT.GGAGgCT^C^ 

TGATGTTTTG 

i 

_ACApATAGAC 

CTCCACCTAT 

AATTCTACAA 

300 

GGCCCAGCCA 

ACCAAACGCT 

GGCAGTGGAT 

GGTACAGCGT 

TACTGAAATG 

TAAAGCCACT 

360 

GGTGATCCTC 

TTCCTGTAAT 

TAGCTGGTTA 

AAGGAGGGAT 

TTACTTTTCC 

GGGTAGAGAT 

420 

CCAAGAGCAA 

CAATTCAAGA 

GCAAGGCACA 

CTGCAGATTA 

AGAATTTACG 

GATTTCTGAT 

480 

ACTGGCACTT 

ATACTTGTGT 

O^CTACAAGT 

TCAAGTGGAG 

AGGCTTCCTG 

GAGTGCAGTG 

540 

CTGGATGTGA 

CAGAGTCTGG 

AGCAACAATC 

AGTAAAAACT 

ATGA rr TTAAG 

TGACCTGCCA 

600 

GGGCCACCAT 

CCAAACCGCA 

AGTCACTGAT 

GTTACTAAGA 

ACAGTGTCAC 

CTTGTCCTGG 

660 

CAGCCAGGTA 

CCCCTGGAAC 

CCTTCCAGCA 

AG TG CAT AT A 

TCATTGAGGC 

TTTCAGCCAA 

720 

TCAGTGAGCA 

ACAGCTGGCA 

GACCGTGGCA 

AACC?ATGTAA 

AGACCACCCT 

CTATACTGTA 

780 

AGAGGACTGC 

GGCCCAATAC 

AATCTACTTA 

TTCATGGTCA 

GAGCGATCAA 

CCCCAAGGTY 

840 


B9S-006 


TCAGTGACCC 

AAGTNAAACC 

ACAGAAAAAC 

AATGGATCCA 

CTTGGGCCAA 

TGTCCCTCTA 

900 

CCTCCCCCCC 

CAGTCCAGCC 

CCTTCCTGGC 

ACGGAGCTGG 

AACACTATX3C 

. AGTGGAACAA 

960 

CAAGAAAATG 

GCTATGACAG 

TG AT AG CTGG 

TGCCCACCAT 

TGCCAGTACA 

AACTTACTTA 

1020 

CACCAAGGTC 

TGGAAGATGA 

ACTGGAAGAA 

GATGATGATA 

GGGTCCCAAC 

ACCTCCTGTT 

1080 

CGAGGCGTGG 

CTTCTTCTCC 

TGCTATCTCC 

TTTGG AC AG C 

AGTCCACTGC 

AACTCTTACT 

1140 

CCATCCCCAC 

GGGAAGAGAT 

GCAACCCATG 

CTGCAGGCTT 

CACCTNTTTA 

CCTCCTCTCA 

1200 

AAGACCTCGA 

CCTACCAGCC 

CATTTTCTAC 

TGACAGTAAC 

ACCAGTGCAG 

CCCTGAGTCA 

1260 

AAGTCAGAGG 

CCTCGGCCCA 

CTAAAAAACA 

CAAGGGAGGG 
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(2) INFORMATION FOR SEQ ID NO: 10: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 
(ix) FEATURE: 

(A) NAME/KEY: Modi f ied- s i te 

(B) LOCATION: 2 85.. 3 96 

(D) OTHER INFORMATION: /note= "Xaa signifies gap in sequence" 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Gin lie Val Ala Gin Gly Arg Thr Val Thr Phe Pro Cys Glu Thr Lys 

15 10 15 

Gly Asn Pro Gin Pro Ala Val Phe Trp Gin Lys Glu Gly Ser Gin Asn 

20 25 30 

Leu Leu Phe Pro Asn Gin Pro Glh Gin Pro Asn Ser Arg Cys Ser Val 

35 40 45 

Ser Pro Thr Gly Asp Leu Thr lie Thr Asn lie Gin Arg Ser Asp Ala 

50 55 60 

Gly Tyr Tyr lie Cys Gin Ala Leu Thr Val Ala Gly Ser lie Leu Ala 
65 70 75 80 

Lys Ala Gin Leu Glu Val Thr Asp Val Leu Thr Asp Arg Pro Pro Pro 

85 90 95 

lie lie Leu Gin Gly Pro Ala Asn Gin Thr Leu Ala Val Asp Gly Thr 

100 105 110 

Ala Leu Leu Lys Cys Lys Ala Thr Gly Asp Pro Leu Pro Val lie Ser 

115 120 125 

Trp Leu Lys Glu Gly Phe Thr Phe Pro Gly Arg Asp Pro Arg Ala Thr 


67 
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130 

He Gin Glu Gin 
145 

Thr Gly Thr Tyr 

Trp Ser Ala Val 
180 

Asn Tyr Asp Leu 
195 

Thr Asp Val Thr 
210 

Pro Gly Thr Leu 
225 

Ser Val Ser Asn 

Leu Tyr Thr Val 
260 

Val Arg Ala He 
275 

Lys Asn Asn Gly 
290 

Val Gin Pro Leu 
305 

Gin Glu Asn Gly 

Gin Thr Tyr Leu 
340 

Asp Arg Val Pro 
355 

lie Ser Phe Gly 
370 

Glu Glu Met Gin 
385 

Gin Arg Pro Arg 

Ala Ala Leu Ser 
420 

Gly Gly 


135 

Gly Thr Leu Gin 
150 

Thr Cys Val Ala 
165 

Leu Asp Val Thr 

Ser Asp Leu Pro 
200 

Lys Asn Ser Val 
215 

Pro Ala Ser Ala 
230 

Ser Trp Gin Thr 
245 

Arg Gly Leu Arg 

Asn Pro Lys Val 
280 

Ser Thr Trp Ala 
295 

Pro Gly Thr Glu 
310 

Tyr Asp Ser Asp 
325 

His Gin Gly Leu 

Thr Pro Pro Val 
360 

Gin Gin Ser Thr 
375 

Pro Met Leu Gin 
390 

Pro Thr Ser Pro 
405 

Gin Ser Gin Arg 


140 

He Lys Asn Leu 
155 

Thr Ser Ser Ser 
170 

Glu Ser Gly Ala 
185 

Gly Pro Pro Ser 

Thr Leu Ser Trp 
220 

Tyr He lie Glu 
235 

Val Ala Asn His 
250 

Pro Asn Thr He 
265 

Ser Val Thr Gin 

Asn Val Pro Leu 
300 

Leu Glu His Tyr 
315 

Ser Trp Cys Pro 
330 

Glu Asp Glu Leu 
345 

Arg Gly Val Ala 

Ala Thr Leu Thr 
380 

Ala Ser Pro Xaa 
395 

Phe Ser Thr Asp 
410 

Pro Arg Pro Thr 
425 


Arg He Ser Asp 
160 

Gly Glu Ala Ser_ 
175 

Thr He Ser Lys 
190 

Lys Pro Gin Val 
205 

Gin Pro Gly Thr 

Ala Phe Ser Gin 
240 

Val Lys Thr Thr 
255 

Tyr Leu Phe Met 
270 

Xaa Lys Pro Gin 
285 

Pro Pro Pro Pro 

Ala Val Glu Gin 
320 

Pro Leu Pro Val 
335 

Glu Glu Asp Asp 
350 

Ser Ser Pro Ala 
365 

Pro Ser Pro Arg 

Phe Thr Ser Ser 
400 

Ser Asn Thr Ser 
415 

Lys Lys His Lys 
430 
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(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GCCCAGGCAG TTGCTGCAGC TGCGGAGTAT GCGGGCCTGA AAGTGGCTCG CCGCCAAATG 6 0 

CAAGATGCTG CTGGCCGCCG CCACTTCCAT GCCTCTCAGT GCCCAAGGCC CACGAGTCCT 12 0 

GTGTCCACAG ACAGCAACAT GAGTGCTGTT GTGATC C AG A AAGCCAGACC CGCCAAGAAG 18 0 

C AG AAAC AC C AGCCAGGACA TCTGCGCAGG GAAGCCTACG CAGATGATCT TCCACCCCCT 24 0 

CCAGTGCCAC CACCTGCTAT AAAATCGCCC ACTGTCCAGT CCAAGGCACA GCTGGAGGTA 3 00 

CGGCCTGTCA TGGTGCCAAA ACTCGCGTCT ATAGAAGCAA GGACAGATAG ATCGTCAGAC 3 60 

AGAAAAGGAG GCAGTTACAA GGGGAGAGAA GCTCTGGATG GAAGACAAGT CACTGACCTG 42 0 

CGAACAAATC CAAGTGACCC CAGA 44 4 

(2) INFORMATION FOR SEQ ID NO : 12 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

Ala Gin Ala Val Ala Ala Ala Ala Glu Tyr Ala Gly Leu Lys Val Ala 

15 10 15 

Arg Arg Gin Met Gin Asp Ala Ala Gly Arg Arg His Phe His Ala Ser 

20 25 30 

Gin Cys Pro Arg Pro Thr Ser Pro Val Ser Thr Asp Ser Asn Met Ser 

35 40 45 

Ala Val Val lie Gin Lys Ala Arg Pro Ala Lys Lys Gin Lys His Gin 

50 55 60 

Pro Gly His Leu Arg Arg Glu Ala Tyr Ala Asp Asp Leu Pro Pro Pro 
65 70 75 80 

Pro Val Pro Pro Pro Ala lie Lys Ser Pro Thr Val Gin Ser Lys Ala 
85 90 95 
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Gin Leu Glu Val 
100 

Ala Arg Thr Asp 
115 

Arg Glu Ala Leu 
130 

Ser Asp Pro Arg 
145 


Arg Pro Val Met 

Arg Ser Ser Asp 
120 

Asp Gly Arg Gin 
135 


Val Pro Lys Leu 
105 

Arg Lys Gly Gly 

Val Thr Asp Leu 
140 


Ala Ser lie Glu 
110 

Ser Tyr Lys Gly 
125 

Arg Thr Asn Pro 


70 
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